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DISTANCE DISTRIBUTION OF FUNCTIONS OVER 
HAMMING SPACES 


by P. DELSARTE 


Abstract 


A theory of functions over Hamming spaces is proposed as a framework 
for the study of codes and of some generalizations. By use of the 
MacWilliams transform it is shown that the distance distribution of 
some classes of functions, including binary QR codes, are asymptotically 
normal. This extends a result of Sidel’nikov. 


1. Introduction 


A. code of length n over an alphabet S may be considered as a mapping of S" 
into the subset (0, 1) of R. In the present paper we consider more general func- 
tions f: S" — R and define their distance distribution, which reduces to the 
usual concept in the case of codes (we refer especially to MacWilliams, Sloane 
and Goethals 7), Welch, McEliece and Rumsey +°) and Delsarte ?-5)). The main 
motivation is to have a better understanding of the MacWilliams identities on 
distance distributions, and of a formal duality associated with them. 

The matter is divided into three parts. Section 2 summarizes some useful 
algebraic Properties of the association scheme defined by the Hamming distance 
over S". 

In sec. 3 we introduce and investigate the concept of distance distribution of 
functions over Hamming schemes. Roughly, the main result is the following: 
an (n + 1)-tuple of real numbers is the distance distribution of some function 
f: S" — R if and only if its MacWilliams transform is nonnegative. The com- 
binatorial interpretation of the concept of strength ^) for an arbitrary f is also 
mentioned. 

Finally, in sec. 4, we consider the problem of evaluating how much a given 
distance distribution (or its MacWilliams transform) differs from the normal 
distribution. Our results generalize a remarkable theorem due to Sidel’nikov °) 
about binary linear codes. The strength (or the distance) of the function is the 
crucial parameter in this matter. 


2. Hamming spaces 


Let S be a finite set of cardinality s = |S| > 2 and, for an integer n > 1, let 
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S" denote the set of all n-tuples x = (xj),z," over S. The Hamming distance 
d : S" X S" — R is defined as follows: 


d, y) = {ie [n]; xA y} xyes", 


where [a, b] stands for (a, a + 1, . . . , b}. It is well known that the metric space 
(S", d) has the combinatorial structure of an association scheme with n classes. 
We shall first recall some results about these Hamming schemes ?), also called 
hypercubic-type association schemes 1+). 

Using S" as row and column labelling set, we define the distance matrices 
Do, Di, ... , Dm of order s", by 


1, if d(x, y) =j, 


0, otherwise. 


Ds) =] 


The D; form a basis of an (n + 1)-dimensional commutative algebra over R. 
This algebra is semisimple and admits a unique basis (Jo, Ji, . . . , Jn) of irreduc- 
ible idempotents J,, which are pairwise orthogonal and satisfy $^ J, = I (= iden- 
tity matrix). The following relation holds: 


sz Je = Sa —z) (d + (s— 1) z7 D; (1) 


k=0 j=0 


where z is an indeterminate. This allows to express the J, in the basis of the D;, 
and conversely. 

The rank of the irreducible idempotents are given by », = rank (Jj) = 
(5 (s — 1)*. Since J, is nonnegative definite, there exists an s"x v, matrix Q, 
over R such that 


= 2,2,7, k e (0, n]. 


In particular, we may take (2, = s7"/? u, where u is the all-one vector. Then we 
construct the square on 

= [Q, 21... Qn], 
of order s". From Jy + Jı p +J = T it immediately follows that 2 is 
orthogonal : Q QT = QT Q = L 


3. Distance distribution of functions 


Let f : S" — R be any mapping of S" into the real field R. In this paper we 
shall restrict our interest to the case where the sum of all values assumed by f 
is nonzero. So, defining 


M=} (J); xes), 


we always suppose M + 0. If f maps S" into {0, 1}, then it corresponds 1-1 to 
a code of length n over S, namely the set of all n-tuples x such that f (x) = 
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In this case, we shall call f itself a code. Following Assmus and Mattson +) we 
suggest the name of a precode (= “code with repeated words") for a function f 
that only assumes nonnegative integer values. If f (x) e R* holds, for all x, then 
it seems natural to call f a fuzzy code (in the present paper it will be irrelevant 
to distinguish between two functions that are equal up to a constant factor). 

The distance distribution of an arbitrary function f, with M z 0, is the (n + 1)- 
tuple A = (Ap, 41, . . . , An) of real numbers A, given by 


A= M^? Y (Ff) fO) ix ye", dx, y) —j), (2) 


for each j e [0, n]. Clearly, 49 > 0 and Y, A, = 1 hold (in classical coding 
theory, the (n + 1)-tuple MA reduces to the distance distribution as it was 
previously called by the author ?-5)). When A; > 0 holds for all j, then we shall 
call fa pseudo-code. Thus we have the following chain with increasing generality: 
code, precode, fuzzy code, pseudo-code, arbitrary function. 

Let now C = (Co, Cy,..., Cn) be any (n + 1)-tuple of real numbers C, with 
Co 0 and » C; = 1. For a fixed real à > 1, we define the MacWilliams 
transform of C to be the (n + 1)-tuple C' = (Cy’, . . . , C,’) resulting from the 
polynomial identity 


+E D Oeo -Y060-2- Az. (3) 


k=0 J=0 


This allows to express each C,’ as a linear combination of the numbers Cj/C,. 
Notice that we have Cy Co’ = (4 + 1)7" and Y Cx’ = 1. Moreover, the corre- 
spondence C — C” is a duality, i.e. (C’)’ = C. This follows from the fact that 
the formal substitution z > u— DT- 4- Àz), , with C e C', leaves (3) invariant. 

We shall now prove an important result about the MacWilliams transform 
on distance distributions, with 4 = s — 1. The first statement is a straight- 
forward extension of a well-known theorem in coding theory ?:7:19), whereas 
the converse result really belongs to the general theory. In passing, we also point 
out that theorem 1 can be generalized to arbitrary association schemes. 


Theorem 1 


Let A be the distance distribution of a given function f: S" — R. The Mac- 
Williams transform A’ satisfies A,’ > 0, for all k. Conversely, let C = (Co, 
C,,..., C) be an (n + 1)-tuple of nonnegative real numbers C, with Co > 0 
and Y, C, = 1. Then there exists a function whose distance distribution A 
satifies A,’ = Cp. i 
Proof. (i) Let v; e R(S") be the vector whose x-component is f (x). Then (2) can 
be written as 


M? A; =v D, v5. (4) 
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Multiplying both members of (1) to the left by v," and to the right by vp, we 
obtain, by use of (3) and (4), 


n n 
Y; (vg? Jg vj) zt = Ao M? Y, Ay z*. 
k=0 k=0 
Since J, is nonnegative definite, the left polynomial has nonnegative coefficients. 
Hence A,’ > 0 holds, for all k e [0, 7]. 

(ii) We now prove the converse result. Let w, be any real vector, with v com- 
ponents, having Euclidean norm ||w,|| = VC. From the w, we construct a 
vector w, with s" components, as follows: w? = (Wo7, ..., Wa’). Let v = Q w, 
ie. QT v = w. We shall now prove that the function f defined by v, = v satisfies 
the requirements. First, using R, = s~"/? u, the reader will easily verify that 
M =u v = s"? wo = + (s" Co)? z 0 holds. Next, from (4) we deduce 


Aj Ag = ||»||-2 (wT D; v) = vf D, v, 


because lell? = llw||? = Y lw[? = Y, C, = 1. Then application of (1), 
with s = A + 1, yields 


n 


(A+ 1)" Ao Y, (97 J, v) zt = we (1 — zy (1 + åz}. 
J=0 


k=0 


Finally, using v^ J, v = ||Q,7 »||? = ||w,|? = Ci, we obtain A,’ = Cr, for all 
k, by comparison with the identity (3) applied to 4. C] 


Two functions are called dual to each other when their distance distributions 
are the MacWilliams transforms of each other. The following result is an imme- 
diate consequence of theorem 1. 


Corollary 2 


A function admits a dual if and only if it is a pseudo-code. 

Let ¢e [1, n]. A function f : S” — R with a given distance distribution A is 
said to be of distance t + lif Ay = A, =... = A, = 0, and of strength t if 
A, =A,'=...= A, = 0. The meaning of the first property is clear, at least 
for fuzzy codes : f(x) f(y) = 0 whenever 1 < d(x, y) < t. We shall now give, 
in theorem 3, the combinatorial interpretation of the concept of strength. For 
any subset T of [1, n] with iz] = t, and any t-tuple o = (@,,..., @,) e S', we 
define the real number 


AAT, o) = X (Jœ i x e$", x, = œ for allie T). 


Theorem 3 
A function f has strength ¢ if and only if A,(T, œ) is a constant (= M 577), 


+? 
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independent of T and w. 

We shall give no proof of this result, because the argument is essentially the 
same as in the particular case of codes ?:5); it is based on the following criterion: 
Q,T v, = 0 fork =1,..., t, that characterizes a function f of strength t. 
Theorem 3 when applied to precodes means that the above definition of 
“strength” agrees with the concept introduced by Rao 9) for orthogonal arrays. 


4. Generalized SidePnikov theorem 


In this section we shall compare distance distributions and their Mac- 
Williams transforms to the normal distribution 


z 


J exp (—¢?/2) dé. 


«Qu? 


N(z) 


Not surprisingly, the binomial distribution will play an important role in our 
study. Let p be any real number with 4 < p < 1, and let q = 1 — p. Then, for 
an integer n > 1, the numbers 
n\ , i 
B, = B;(n, p) = i)? q", je [0,n}, 


define the binomial distribution B = B(n,p) = (Bo, . . . , B,). In case 
p = (s—1)/s, q = 1/s, itis worth mentioning that B is in fact the distance distri- 
bution of any nonzero constant function on S". The mean y and the variance o 
of B are given by 
u-—np œ =npq. 
To any (n + 1)-tuple C = (Co, C, . . . , Ca) of real numbers C; such that 
C, 2 O0 and ), C, = 1 we associate the mapping C : R — R defined by 
C(z) = 25 (C; jje [0, n], J > po z). 
If C; > 0 holds for all j, then C can be interpreted as a probability distribution. 
For integer r > 0, let u,(C) be the rth moment of C, that is 
BAC) = X, CK(G =Y. 
j=0 


The next two lemmas, which do not really belong to coding theory, are natural 
generalizations of results due to Sidelnikov °). For the proof of lemma 4, cf. 
also Welch, McEliece and Rumsey +°). 

Lemma 4 


Suppose the MacWilliams transform C” of C satisfies C,' — C;'—...— C,—0 
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for some given r e [1, n]. Then (C) = p, (B(n, p)) holds, with p = A/(1 + A), 


for allt <r. 
Proof. Let y(€) be the characteristic function of C(z), that is 


x) = 2, (C) G&y'r! 


C; exp (if (u — j)/o). 


0 


n 


J 


Using the latter expression, we can give the MacWilliams identities (3) the fol- 
lowing equivalent form 


WE) = Co Y a*7" Cy’ (exp (ip Eo) — exp (—iq £[o))* x 
g x {g exp (i p E/o) + p exp (ig Ejc)-*. 


Expanding each term {. . .}* (. . ."7* as a power series in i£/o we obtain an 
equation like 


1 jt r n 
y(&) = Co > H5 (X a^" Frc Cy}. (5) 
ri\oa k=0 
r=0 


It can be easily verified that the real numbers F,,, satisfy F,,, > O with Fp, = 0 
when k >r and 


Fro =o u( Bin, p) 
=y) u- © 


So, equating the coefficients of &' in both members of (5), we have 


min(r,n) 


o” HKC) T Co 2 q=" Fk Ck, 


k=0 
for all r. Since Co Cy’ = q”, this clearly implies the assertion. 0 
Remark. When C is a distance distribution, then it follows from theorem 1 and 


F,,, = 0 that p C) > u(B(n, p)) holds for all r. Although this is an important 
property 1°), it is not used in the proof of theorem 6 below. 
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Lemma 5 


Let C be any probability distribution all of whose moments y,(C) exist. 
Suppose that, for given integers t and n with 2 < t < n, and for a real number p 
with 4 < p < 1, one has uL, (C) = u,(B(n, p)), for r = 0, 1, , t. Then there 
exist some positive constants a, b such that 


|C@) — NG)| < (a + b (pla)! ?) 1-47. Q) 


Proof (similar to lemma 8 of Sidel'nikov ?)). By a classical result (cf. Feller 9), 
p. 512), we have 


? 


1 F- ee £2) 24 
lc — Ne) < E recte CH dé + t Tr Opie 


-T 


where y(£) is the characteristic function of C(z) and T is any positive number. 
We mention, without giving details, the following bound for the above integral: 


f 


-T 


4T' 
(ry) 


y(&)—exp C—£?/2) 
é 


3 prat? 
LEE s.p) + (—2™ ) 
nq 


whenever r < t, r = 0 (mod 2) and T < o/(p? + q?). So the crucial point is to 
find a good upper bound for u,(B). From (6) it is possible to deduce 


epr r/2 r 1/2 


for any even r. Then the choice T = (2 r q/p e°)'/? leads to the assertion of the 
lemma, as can be easily verified. C] 


The following two theorems contain the direct applications of lemmas 4 and 
5, with p = (s — 1)/s, to distance distributions and their MacWilliams trans- 
forms (with 4 = s — 1 = p/q). 

Theorem 6 

The distance distribution C of any pseudo-code with given strength 7 satis- 
fies (7). 

Theorem 7 


Let f be any function of given distance ¢ + 1. Then the MacWilliams trans- 
form C of the distance distribution of f satisfies (7). 
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There are several interesting applications of theorems 6 and 7 in classical 
coding theory. We shall only consider a simple example: For a prime number 
m = + 1 (mod 8) let f be the extended quadratic residue code of length 
n = m + 1 over S = GF(2). It is well known (cf. for instance Berlekamp ?), 
p. 353) that f is self-dual with distance > ym. Hence it follows from theorem 6 
(or 7) that the distance distribution A of f tends uniformly to the normal distri- 
bution when m —-oo. More precisely, we have | A(z) —N l| = O(m!!^) for 
all z. 


MBLE Research Laboratory Brussels, October 1974 
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UNIFORMLY PACKED CODES 


by J. M. GOETHALS and H. C. A. van TILBORG *) 


Abstract 


A new definition is proposed for the concept of a uniformly packed 
code, which generalizes the original definition given by Semakov, 
Zinovjev and Zaitzev, and is related to the one proposed by Bassaligo, 
Zaitzev and Zinovjev. Connections with the work of Delsarte on the 
association schemes of coding theory are pointed out. Several examples 
and some open cases are mentioned. 


1. Introduction 


Uniformly packed codes were originally introduced, in the binary case, by 
Semakov, Zinovjev and Zaitzev ?!) They are quasi-perfect codes satisfying 
certain regularity conditions with respect to the distance relations. Binary per- 
fect and nearly perfect codes +°) belong to this family. Recently, a more general 
definition, which applies to any field GF(g), was proposed by Bassaligo, Zaitzev 
and Zinovjev ?). Their definition, however, appears to be so general that almost 
every code can be viewed as a uniformly packed code. In this paper, we propose 
another definition, which extends more naturally the original concept intro- 
duced by Semakov et al.?"), and reduces to the one given by Bassaligo et al.?) 
in the case of a quasi-perfect code. The paper is organized as follows. In sec. 2, 
we introduce the concept of the characteristic polynomial of a code, which is 
equivalent to the notion of minimal polynomial introduced by Delsarte 9), 
and whose degree was called by him the external distance of the code. These 
concepts play a central role in this paper. In sec. 3, we introduce the concept 
of a t-regular code, which generalizes the completely regular codes defined by 
Delsarte 5). We show in particular that a t-regular code contains 7-designs. In 
sec. 4, we discuss perfect, nearly perfect and uniformly packed quasi-perfect 
codes and show that all these codes are completely regular. In sec. 5, we intro- 
duce the concept of a jth-order uniformly packed code, and show that such a 
code is f-regular with t = d — e — j, where d is the minimum distance and 
e the error-correcting capability of the code. The paper is completed with a 


*) Department of Mathematics, Technological University Eindhoven. 
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list of those uniformly packed codes which are known to us, and some non- 
existence proofs, given in the appendix. Several open cases are also mentioned. 


2. The characteristic polynomial of a code 


2.1. Definitions 


Let F be a finite field GF(g), where q is a prime power; and let V = F” 
be the vector space of all n-tuples over F. The weight w(u) of a vector ue V 
is the number of its nonzero components, and for any two vectors u, v e V, 
their Hamming distance d(u, v) is the weight of their difference: 


d(u, v) = w (u — v). 


A q-ary code C of length n is any subset of V, with |c] > 2; its minimum 
distance d(C) is the minimum value of the distance between any two distinct 
elements of C. The distance enumerator of a code C is the formal polynomial 


Ac(z) = |c|-+ Z gd 


u,veC 


Thus, the coefficient A, of z! in its expansion Ac(z) = », A,z', is the average 
over all code vectors of the number of code vectors at distance i from a given 
code vector. We observe that the A, are rational numbers satisfying 4o = 1, 
and by A, — |c]. A code C is said to be distance-invariant if, given a code 
vector u, the number A,(u) of code vectors at distance i from u is a constant 
not depending on u for every i, 0 <i <n, in which case we have A; = A,(u), 
an integer. The weight enumerator of a code C is the formal polynomial 


Wz) = Y, z". 


uec 


Thus, the coefficient of z' in its expansion equals the number of code vectors 
of weight i. For any distance-invariant code C containing the all-zero vector, 
its distance enumerator is identical to its weight enumerator. 


2.2. Codes as elements in a group algebra 


The vector space V has, with respect to vector addition, the structure of an 
elementary Abelian group of order q”, of which the g"-dimensional group 
algebra C(V) over the complex field C can be described as follows. To each 
element ue V we associate a basis element e(u) of C(V). Every element of 
C(V) is uniquely expressed as a linear combination 5, c(u) e(u) of these q” 
basis elements, with complex coefficients c(u). The product e(u) « e(v) of the 
basis elements e(u), e(v), is defined by e(u) + e(v) = e(y), with y=u + vin 
V. 'This induces the following multiplicative rule in. C(V): 
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(x B(u) a) $ b eu) eo) - z( Y Bw) x) (6. 0 


y \utvsy 


To any subset S C V we associate the element 


S= Y elu) 


.ueS 
in C(V), for which we use the same symbol. In this way, a code C can be 
viewed as an element of C(V). The following subsets, 


Y, = (ue V | w) = i}, i=0,1,...,7, 


considered as elements of C(V), form the basis of an (n + 1)-dimensional 
subalgebra, cf. Van Lint !^). For a code C, we have, according to the multi- 
plicative rule (1), 


Y,* C = 3, fu) eQ), 


where f;(u) is the number of code vectors at distance i from v. Similarly, with 
S; defined by 


J 
S=}, Y, 
i=0 (2) 
we have 
S,* C = Y, gu) elu), (3) 
where 


J 
g(u) = J, fi) 
i=0 


is the number of code vectors at distance j or less from u. For any j, with 
2j< d(C), we have g;(u) = 0 or 1 in (3), since any two distinct code vectors 
are at least at distance d(C) from each other. Hence, S; « C "represents" in 
this case the subset of V consisting of the fel disjoint spheres of radius j centred 
at the code points. 


2.3. Krawtchouk polynomials 


Let y be any nonprincipal character of the additive group of GF(g). Then, 
to every ue V we associate a character y,: V — C defined by 


Xu?) = x(u, v), Yver, 


where (u, v) denotes the inner product in V. We now extend each y, to a 
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linear function from C(V) to C by . 


ap c(v) «9)) = }, c(v) x). 
We shall make use of some properties of these linear functions, which we state 
without proof in the following lemma. 


Lemma 1 


O VA,BeC(V), x(4 * B) = xA) x.(B). 
Gi) For any we V with w(u) — j, we have 


© XY) = PG), 
where P,(x), a polynomial in x, is formally defined by 


LP z = (1 + @—1) z)""* (1 —2)*. (4) 


n 
(iii) S, = Y, Y, is the unique element of C(V) satisfying 
i=0 


Xo(S$,) = 4", x($) = 0, Vu #0. 


For a proof of these properties, we refer for example to Van Lint 14). 


The Krawtchouk polynomials are the polynomials P,(x) defined by (4), for 
which the following expression can be obtained, for i= 0, 1, ..., n, 


i 


n—kN (x 
ra= 3 a-d C (D (5) 
k=0 
x 
om 
0 


()-se-n..e—&* 1)/k!, 


where 


and 


which clearly shows that P,(x) has degree i. They satisfy the orthogonality 


relation 
: n n 
x " (q — 1)* PK) Pk) = ôi, (^) (q — 1) q", 


k=0 
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which, by use of the identity 


n n 
( ) (a — 1} PK) = ( ) (g — 1)! P.C), 
k j 


simplifies to 


Y Pk) PG) = 61,59". (6) 


k=0 


Other properties may be found in the classical book by Szegó 2°). 
For a code C, let us define, for j = 0, 1, ..., n, the n + 1 real numbers 
B,, which we call the characteristic numbers of C, by 


B, = |cr* X opp. 


weY J (7) 
Let further N(C) be the set of integers 
N(C) = (j|1 <j «n, B, #0}, (8) 
and let Fc(x) be the polynomial 
Fe(x) = 4"|C|-* TI 0 — xh). (9) 
JeN(C) 


Since, apparently, the Krawtchouk polynomials form a basis of the set of 
polynomials of degree less than or equal to n, Fc(x) is uniquely expressible as 


Fe(x) = X}, a, P.Q9. (10) 


i-o 
The polynomial (9) will be called the characteristic polynomial of the code, and 
(10) its Krawtchouk expansion. We can now state the main result of this section. 
Theorem 1 


Let a, &,, ..., «, be the coefficients of the Krawtchouk expansion of the 
characteristic polynomial F(x) of a code C. Then, as an element of C(V), the 
code C satisfies 


9X7 Y,*C- Sn- (11) 


i=0 


Proof. Let us consider the value any given linear function y, assumes on the 
left-hand side of (11). By lemma 1 (i), we have 


(Sa n«c)-n(x« r.) x40) . (12) 
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Let u have weight j. Then, by lemma 1 (ii), we have 


«(E ay n) T 2 a, Pj) = Fc), 


and, from the definition of the characteristic polynomial, cf. (7), (8) and (9), 
it follows that, for any u + 0, if y,(C) z& 0, then Fe(j) = 0. Now, for u = 0, 
we have xy,(C) = icl, and F,(0) = q" |c|-*, from which it follows that the 
value of (12) is zero for any u 4 0, and q” for u = 0. Hence, by lemma 1 (iii), 
the theorem is proved. 


Corollary 1.1. Let a, &,, ..., %, be the coefficients of the Krawtchouk ex- 
pansion of the characteristic polynomial of a code, and let /,(u) denote, for 
any u € V, the number of code vectors at distance i from u. Then, 


Vue, Sarsi (13) 


i=0 


Proof. Let }, c(u) e(u) be the element defined by (11) in C(V). Then, we have 
c(u) = Y, a, f(u) in the left-hand side, and c(u) = 1 in the right-hand side. 


Remarks. The results of theorem 1 and corollary 1.1 still apply if the charac- 
teristic polynomial Fe(x) is replaced by any polynomial F(x) = F, a; P(x) 
divisible by F(x), and satisfying F(0) = q"|C|-!. The above results were 
already implicitly obtained by Delsarte 5*6), who called Fo(x) the minimal 
polynomial, and its degree s the external distance, of the code C. The following 
corollary, due to Delsarte 5), clearly shows the reason for calling s the external 
distance. 


Corollary 1.2. Let the characteristic polynomial of a code C have degree s, 
and let o(C) be the covering radius (or true external distance) of the code C, 
that is 
o(C) = max {min d(u, v)}. 
ueV vec 


Then 
e(C) <s. 
Proof. From corollary 1.1, we have 


x: a, fi(u) = l1. 


i-o 
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Now, suppose o(C) > s. Then, there exists ue V, such that d(u, v) > s, for 
all ve C, whence f,(u) = 0, for i = 0, 1, 2, ..., s, and 


T ufi) = 0, 


i=0 


contradicting (13) and proving the corollary. 


2.4. The MacWilliams identities 


Let C bea linear code, that is a subspace of V, and let CL, its dual (or orthog- 
onal) code, be defined by 


CL = {ue V | (u, v) = 0, Y ve C}. 
Then, 
lc| if uec, 


KC) = 
xC) 0 otherwise , 


(14) _ 
from which it follows that B;, defined by (7), equals the number of vectors of 
weight j in Ct. Hence, the distance enumerator of the code C1, which is linear, 
hence distance-invariant, is the polynomial 


Be(z) = 5, B, z’. 
1 


Theorem 2 


The distance enumerators Ac(z) of a linear code C, and Bc(z) of its dual, 
are related by (the MacWilliams identity) 


Ac) = a7"|c| X 8,0 —2! 0 + (4—1) 2777. (15) 
J=0 


This identity, first proved for linear codes by MacWilliams !?), can be ex- 
tended to arbitrary codes in the following way. Let, for an arbitrary code C, 
its n + 1 characteristic numbers B,, j = 0, 1, ..., n, be defined by (7). Note 
that, by definition, we have B, = 1, and Vj 4 0, B, Æ 0 iff Fe(j) = 0, where 
F(x) is the characteristic polynomial of C. It is therefore apparent that the 
number of distinct nonzero characteristic numbers is equal to s + 1, where s 
is the degree of F(x). 


Theorem 3 


The distance enumerator A,(z) of a code C is uniquely determined from its 
s + 1 nonzero characteristic numbers by means of the MacWilliams identity 


(15). 
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The above theorem is a generalization of a result obtained by MacWilliams 
et al.19) on the weight enumerator of a code, and can be proved along the same 
lines. We shall omit the proof, and since we shall need it, we recall below the 
result obtained by MacWilliams et al.19). l 


Theorem 4 
For a code C, let the n + 1 real numbers C, j= 0, 1, .. ., n, be defined by 
C; = |er* È xC). (16) 
ueYy 


Then, the weight enumerator W,(z) of C is given by (the Mac Williams identity) 
Wee) —47"|c|x 60—2'a + 6-02". | qo» 
J=0 


Remarks. (i) Apparently, if the characteristic number B, defined by (7) is zero, 
we have y,(C) = 0, Vue Y;, whence C; = 0. Hence, the number of distinct 
nonzero C, is at most equal to s + 1. 

. (ii) The reader may have the impression that C, defined by (16) is, in general, 
a complex number. However, the equivalent expression 


C, T jej: > AAV) 
vec 
clearly shows (cf. lemma 1) that C, is a real number. 
(iii) It should be clear from (14) that, for a linear code C, we have C, = Bj, 
which equals the number of vectors of weight j in Ct. 


3. t-regular codes 


In the preceding section, we have introduced the concept of the characteristic 
polynomial Fc(x) of a code C, and we have shown (cf. corollary 1.2) that its 
degree s, called the external distance of C, is at least equal to the covering 
radius o(C) of the code. It should be clear that o(C) > 4 (d — 1), where 
d= d(C), the minimum distance of C. Hence, d < 2s + 1. In this section, 
we shall be concerned with codes having the property d > s, that is, 


s<d<2s+1. (18) 


Codes having this property are interesting as they satisfy certain regularity con- 
ditions with respect to the distance relations in the space V. In particular, we 
have the following theorem, due to Delsarte >), of which we omit the proof. 


Theorem 5 


Let the external distance s and the minimum distance d of a code C satisfy 
s <d. Then, C is distance-invariant. 
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For a given code C, and a given v € V, let D(v) denote the code 
D(v) = {u—v| ue C}, (19) 


obtained from C by translation. It should be clear that all D(v) have the same 
distance enumerator and the same characteristic numbers. However, their weight 
enumerators are, in general, different. An important question is whether, for 
distinct u, v e V, the corresponding D(u), D(v) have distinct weight enumera- 
tors. In that respect, it is interesting to quote the following result, which can 
be obtained from ref. 5, theorem 3.2. 


Theorem 6 


Let s be the external distance of a code C. Then, its weight enumerator 
Welz) = Y, c; z! is uniquely determined from its first s components co, c, 
MEDIO 
Proof. Let Fc(x) be the characteristic polynomial of C, of degree s, and let 
N(C), cf. (8), be the set of its s distinct zeros. By theorem 4, the weight enumera- 
tor W,(z) is uniquely determined from the real numbers C, defined by (16), 
where Co = 1, and, for any j 4 0, C, #0 only if je N(C). From (17), and 
by use of (4), we obtain the following expression for the coefficients of 
Wz) = 3 c, z': 


e, — 47" |C| X C; PG), ij — 0, 1,...,n. 
Jj 


Hence, for i= 0, 1,..., s— 1, we have 


È GPU) = |c|-* c, — P,(0), (20) 
JeN(C) 
a system of s linear equations in the s unknowns Cj, je N(C). It is now a 
simple matter to show that this system has a unique solution. Let k be any 
given element of N(C), and let F,(x) be the unique polynomial of degree 
s—1 satisfying F,(k) = 1, and F,(j) = 0, for any j Æ k, je N(C). Then, if 
F,(x) has the Krawtchouk expansion 


1 


F(x) = X bi P(x), 
1-0 


we easily obtain from (20): 
s—1 
C, = q” Ier: È, bi ci — FQ). 
i=0 


Hence, the nonzero C; are uniquely determined from the first s coefficients of 
the weight enumerator, and, by theorem 4, this completes the proof. 
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Let us now consider any D(v), defined by (19). It has the same external 
distance s as C. On the other hand, in its weight enumerator 


W() = LA) z' 


the coefficient f,(v) is equal to the number of code vectors u e C at distance i 
from v. Hence, according to theorem 6, the weight enumerator W(z) is uniquely 
determined from the s components fiw), i = 0, 1, ..., s— 1. Let now w be 
the minimum weight of D(v), that is the smallest integer w such that f, (v) 4 0, 
and fi(v) = 0 for i —0,1,...,w— 1, in the weight enumerator of D(v). Note 
that, necessarily, w < 5. 

For an integer t, with O <t < s, we shall say that, a code C is t-regular if, 
for every v e V such that D(v) has minimum weight w < £, the weight enumera- 
tor of D(v) depends only on w. Note that a zero-regular code is distance- 
invariant, and that an s-regular code is completely regular, in the sense of 
Delsarte 9). 


Theorem 7 


(i) If the minimum distance d and the external distance s of a code C satisfy 
s «d « 2s — 1, then C is (d — s)-regular. 

(ii) If d > 2s — 1, then C is completely regular. 
Proof. Let v € V be any vector such that D(v), defined by (19), has minimum 
weight w < d — s, and let x e D(v) have weight w. Then, since the minimum 
distance in D(v) is d, any other vector y e D(v) has weight at least equal to 
d — w Z s. It follows that the first s components f,(v), i = 0, 1, ..., s— 1, 
of the weight enumerator of D(v) all are equal to zero, except for f,(v) = 1. 
Hence, by theorem 6, the weight enumerator is uniquely determined from 
these s components which, clearly, depend only on w. This proves the first 
part of the theorem. Let us assume now d > 2s— 1, that is d—5s >s—1. 
Then, from the above discussion, it clearly appears that C is (s — 1)-regular, 
-but for any D(v) with minimum weight w = s, we have, by definition, 
fv) = 0 for i20, 1, ..., s— 1, and, by theorem 6, the same conclusion 
holds on its weight enumerator. Hence, in this case, C is completely regular, 
and the theorem is proved. 


Remark. The above theorem was already obtained by Delsarte, cf. ref. 6, 
theorems 5.11 and 5.13. 


Binary ¢-regular codes are interesting as they contain ¢-designs, as we shall 
see. It is perhaps worth noting that many recently discovered t-designs were 
found in t-regular codes. Let us recall that a t-design S,(t, k, n) is a collection 
of k-subsets of an n-set, which are called blocks, such that every t-subset is 
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contained in precisely A blocks. We shall identify a binary vector of length n 
and weight k with the k-subset of its nonzero coordinate places. The following 
theorem is a slight variation of theorem 5.7 of Delsarte 5). The proof is quite 
similar and will be omitted. l 


Theorem 8 


Let C be a t-regular binary code of length n, containing the all-zero vector, 
and let its minimum distance d satisfy d > 2t. Then, for each weight k, the 
collection of code vectors of weight k defines a t-design. 


A similar result holds in the g-ary case, which involves the concept of a q-ary 
t-design, to be defined as follows. We shall say that a vector x e V is covered 
by we V if, for every nonzero component x, of x, the corresponding com- 
ponent u; of u has the same value, that is 


Vx, 40, u = x, 


Then, a collection SS of vectors of weight k in V is said to form a q-ary t-design 
D,(t, k, n) if every vector x e V of weight t is covered by precisely u members 
of S. 


Theorem 9 


Let C be a t-regular q-ary code of length n, containing the all-zero vector, 
and let its minimum distance d satisfy d > 21. Then, for each weight k, the 
collection of code vectors of weight k defines a q-ary t-design. 

The full proof of this theorem is rather long and will be omitted. We shall 
only give a sketch of the proof, which proceeds by induction on k. We first 
show that the theorem is true for k — d, the minimum distance of C. Let 
veéV be any vector of weight ¢, and let D(v) be the translated code (19). 
Since d > 2t, D(v) has minimum weight 7, and since C is t-regular, the weight 
distribution of D(v) depends only on f, and not on v. Hence the number of 
vectors of weight w — t in D(v) is a constant depending only on w and f. But, 
any x e D(v) is of the form x = u — v, with ue C. Hence, for any x e D(v) 
with weight w(x) = w — t, the weight of the corresponding ue C satisfies 


w(u) < w(u — v) + w(v) = w, 


with equality only if v is covered by u. On the other hand, we have w(u) > d, 
for any u Æ 0, ue C. Therefore, the number å of vectors of weight d — t in 
D(v) is precisely equal to u(v), the number of code vectors ue C of weight d 
and covering v, or to u(v) + 1, when v itself has weight d — t, that is when 
d = 2t. In any case, u(v) is a constant depending only on d and t£, and this 
proves that the vectors of weight d in C form a ¢-design D,(¢, d, n). Then, 
the proof is completed by showing that, if the result is true for all k < w, 
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it is also true for k = w. To that end, it suffices to show that, under the 
induction hypothesis, the number of code vectors u e C of weight k < w, such 
that u —v e D(v) has weight w — 7, is a constant depending only on k, w 
and t. It then follows that the number of code vectors u e C of weight w, such 
that u — v has weight w — 1, that is covering v, is a constant since u — v has 
weight w — t only if w(u) < w. 


Remark. 'The above result was already mentioned for the case when k — d in 
theorem 5.9 of Delsarte 5). 


4. Perfect, nearly perfect, and uniformly packed quasi-perfect codes 


A code C is said to be e-error-correcting if its minimum distance d is equal to 
2e-- 1 or 2e+2. For any e-error-correcting code C, and for any vector 
v € V, there is at most one code vector ue C such that d(u, v) < e. An e- 
error-correcting code is perfect if, for every ve V, there exists exactly one 
code vector at distance e or less from v. 


Theorem 10 


An e-error-correcting code C is perfect iff, as an element in C(V), it satisfies 
Se * C = Sp 
Proof. It suffices to observe that in 


Se * C —, g«(v) e(v), 


veV 


cf. (3), the coefficient g,(v) is equal to the number of code vectors at distance e 
or less from v. Hence, an e-error-correcting code is perfect if, and only if, 
gv) = 1, V ve V, which proves the theorem. 

This theorem has a number of consequences, cf., for example, Van Lint !*). 
We shall only mention (without proof) the following corollaries. 


Corollary 10.1. The characteristic polynomial of a perfect e-error-correcting 
code is the Lloyd polynomial !9) L,(x) of degree e, defined by the Krawtchouk 
expansion 


L(x) = X, Pi). (21) 
i-0 


Corollary 10.2. A perfect code is completely regular. 


Apart from the infinite family of single-error-correcting g-ary codes, which 
exist for any prime power q and every length n of the form n = (q™ — 1)/(q — 1), 
and the trivial binary repetition codes consisting of two code vectors of length 
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n = 2e -}- 1 at distance n from each other, only two other perfect codes are 
known, namely the two Golay codes: the ternary double-error-correcting code 
of length 11, and the binary triple-error-correcting code of length 23. It was 
recently shown that there are no other g-ary perfect codes ?*), and that codes ^ 
with the Golay parameters are unique 8). 

An e-error-correcting code C, which is not perfect, is said to be quasi-perfect, 
if every v e V is at distance e + 1 or less from at least one code vector, that is, 
if its covering radius g(C) = e + 1. Let C be a quasi-perfect e-error-cor- 
recting code, and let v e V be at distance e + 1 from some code vector ue C 
Then, if the minimum distance d(C) is 2e + 2, v is at distance e +- 1 or 
more from every code vector. But, if d(C) = 2e + 1, v can be at distance e 
from some other code vector, or be at distance e + 1 or more from every 
code vector. In general, the number of code vectors at distance e + 1 from 
v depends on v. 

We shall say that an e-error-correcting code C is a uniformly packed quasi- 
perfect code with parameters A, u, if, for every v e V at distance e from some 
code vector, the number of code vectors at distance e + 1 from v is a con- 
stant A, and if, for every ve V at distance e+ 1 or more from every code 
vector, the number of code vectors at distance e + 1 from v is a constant u. 
Note that 2 = 0 iff d(C) = 2e + 2. 


Theorem 11 


A quasi-perfect e-error-correcting code C is uniformly packed with param- 
eters A, u, iff, as an element in C(V), it satisfies 


(u Se— À Ye + Yer1)* C= M Sn Q2) 
Proof. The set D of ve V at distance e + 1 or more from every code vector 
is represented by the element 
D=S8,—S,* C (23) 
in C(V). On the other hand, in 
Yer1 * C = , c(v) e(o), 


veV 


the coefficient c(v) is equal to /,.,(v), the number of code vectors at distance 
e-+1 from v. Thus, according to the definition given hereabove, C is uni- 
formly packed iff we have f,,.,(v) = y, V v e D, and f,, (v) = A, for every v 
in the set 
T{C) = {ve y| 3ueC, d(u, v) = e}, 


represented by Y,* C in C(V). Hence, C is uniformly packed iff 
You # C= AY, 4 C+ uD. (24) 
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Now, from (23) and (24), we easily obtain (22), which proves the theorem. 
Theorem 12 


An e-error-correcting code C is a uniformly packed quasi-perfect code with 
parameters A, yw, iff its characteristic polynomial is the polynomial of degree 
e + 1 defined by the Krawtchouk expansion 5, a, P;(x) with coefficients 


€,4, = l/4, ae = 1—Afu, and a, =1 for 0O<i<e—1. (25) 


Proof. Let the characteristic polynomial of C be the polynomial of degree 
€ + 1 defined by the Krawtchouk expansion Y a, P,(x) with coefficients (25). 
Then, C has external distance s = e + 1, hence is quasi-perfect. On the other 
hand, by theorem 1, as an element in C(V), it satisfies 


2, u Y, * C = Sp 


with the coefficients (25), which implies (22). Hence, by theorem 11, C is 
uniformly packed, which proves the “if” part of the theorem. Let us assume 
now that C is a uniformly packed quasi-perfect e-error-correcting code. Then, 
e(C) — e + 1. Hence, from corollary 1.2, the external distance s satisfies 
s >e + 1. Now, let us consider the polynomial F(x) of degree e + 1 defined 
by the Krawtchouk expansion 


F(x) = >: a, P(x), 


with coefficients given by (25). Then, from (22), we easily deduce that, for 
. every u of weight j = 0 such that y,(C) = 0, we have F(j) = 0. Hence, F(x) 
is divisible by the characteristic polynomial Fe(x) of degree s, cf. (9), which 
implies e + 1 >s, whence e + 1 = s, and F(x) =k F,(x), for some con- 
stant number k. Finally, from F(0) = q" igs which follows from (22), we 
deduce k = 1, that is Fc(x) = F(x), hence proving the theorem. 


From the above theorem we obtain the following necessary condition for 
the existence of a uniformly packed quasi-perfect code with parameters 4, u. 
The reader will observe the similarity with the condition obtained by Lloyd !5) 
for perfect codes. 


Corollary 12.1. If a q-ary quasi-perfect e-error-correcting code C of length n 
exists, which is uniformly packed with parameters A, u, then the polynomial 
F(x) defined by the Krawtchouk expansion 
e-1 
F(x) = X, Pix) + (1 — Alp) Po(x) + (9) Pes 1(X), (26) 


i-o 


has e + 1 integral zeros in the interval [1, n], and |C| F(0) = q" holds. 
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Proof. 'The result follows immediately from theorem 12, by definition of the 
characteristic polynomial of a code. 


Corollary 12.2. A quasi-perfect uniformly packed e-error-córrecting code is 
completely regular. 

Proof. This follows immediately from theorem 7 (ii), since s = e+ 1 by 
theorem 12, and d > 2e + 1 by definition. 


Corollary 12.3. In a q-ary quasi-perfect uniformly packed e-error-correcting 
code C, containing the all-zero vector, the code vectors of weight k define, 
for any k, a q-ary t-design, with t = eif d = 2e + ljort = e + lifd —2e + 2. 
Proof. 'This follows immediately from theorem 9, since C is completely regular, 
hence (e +- 1)-regular, and a fortiori e-regular, by corollary 12.2. 


Quasi-perfect uniformly packed binary codes, for which 4+ 1 = u, were 
introduced by Semakov et al.?1), who showed that, for an e-error-correcting 
code of length n, the following inequality holds: 


(e 4-1)A4 in—e. (27) 
Binary codes for which the following equality holds, 
At+l=p=[@4+ DK + 1), 


where [x] denotes the integral part of x, were called nearly perfect by Goethals 
and Snover +°), as they satisfy with equality a specialized version of the sphere- 
packing bound, due to Johnson !?). 


In a recent paper, Bassaligo et al.?) proposed a more general definition of 
uniformly packed codes, which, for quasi-perfect codes, is equivalent to ours. 
In particular, they showed that, in the g-ary case, the following inequalities 
hold, which generalize (27): 


(e - 1) 4 & (a —1) — e), (e+ 1) & & (a — 0) n. Q8) 


Clearly, the definition of a nearly perfect code can be extended to the g-ary 
case. We shall say that an e-error-correcting g-ary code is nearly perfect if it is 
uniformly packed with parameters A and y satisfying 


À = [(n— e) (a — 0Ke + 2], »& — [n (a — De + 01. 


Note that a perfect e-error-correcting code can be viewed as a nearly perfect 
code with parameters 


A=(n—e)(Q—)Ie+), w=A+q—], (29) 
although, by theorem 10, we have D = 0 in (23), whence y is irrelevant in (24). 
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However, it can be shown that it satisfies (22), with 2 and u given by (29). 
Apart from perfect codes, the following nearly perfect codes are known 1°): 
(i) the shortened binary single-error-correcting codes, which exist for any 
length n of the form n = 2" —2,m z3; 
(ii) the binary nonlinear double-error-correcting codes, discovered by 
Preparata ?°), which exist for any length n of the form n = 4" — 1, m > 2. 
It is still an open question whether other nearly perfect codes do exist. 
There exist, however, many uniformly packed quasi-perfect codes. Most of 
them are obtained by application of theorem 12 to linear codes C. It suffices 
to observe that, for a linear code C with dual code Ct, the characteristic poly- 
nomial of C is identical to the annihilator polynomial of Ct, defined as follows. 
For an arbitrary code D, the annihilator polynomial of D is the polynomial 
Gp(x), defined from the set 


M(D) = {j | 3 u, v e D, dtu, v) =j #0} (30) 
by 
Gy) = |D| TI G — xl). (31) 
jeM(D}) 


Comparing (8), (9) to (30), (31), we easily deduce that, for a linear code C, we 
have Fc(x) = Ge1(x). Hence, from theorem 12, we obtain the following corol- 
laries. 


Corollary 12.4. A linear e-error-correcting code C is a uniformly packed quasi- 
perfect code, with parameters A, u, iff the annihilator polynomial of its dual 
code C has the Krawtchouk expansion (26). 


Corollary 12.5. A linear single-error-correcting code is quasi-perfect and uni- 
formly packed iff its dual is a two-weight code. 

Proof. Let C be a linear single-error-correcting code of length n, and let its 
dual C+ be a two-weight code, that is, having a weight enumerator of the form 


Wolz) = 1+ N,z" + N3 22, (32) 


where only two weights, w,, W2, occur with a nonzero coefficient. From the 
MacWilliams identity (17), and by using the fact that d(C) > 3, we deduce 


1+N,+N, = 9"|C|-}, 
and 
P) + Ni Piw) + N; P(w;) = 0 for i= 1,2. 


From these equations, it follows that, in the Krawtchouk expansion 
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F(x) = 2 a, Pœ) 
i=0 


of the characteristic polynomial of C, satisfying, by definition, 
F,(0) = q" lal=s, Fc(wi) x 0, for i— l, 2, (33) 


the coefficient «o = 1. The other coefficients are easily obtained from the def- 
inition (33), and can be given the form a, = 1/4, a, = 1 — A/u, with 


24 = (n— 1) (a — 1) + Pid) + 1) (Pid) + 1), 
2u = n (q — 1) + P (w1) Pi(w2). 


Hence, by theorem 12, the corollary is proved. 


Linear two-weight codes, with a single-error-correcting dual code, were 
studied by Delsarte *), who showed that they are related to certain strongly 
regular graphs (that is, association schemes with two classes), and called them 
two-weight projective codes. He also showed that, from a two-weight projective 
code of length n and dimension k over some extension field GF(qg”) of GF(q), 
a two-weight projective code of length n (g" — 1)/(g — 1) and dimension k m 
over GF(q) can be obtained. Note that a two-weight projective code C, of 
length n and dimension k over GF(q), with weights w,, w2, is in a one-to-one 
correspondence with an n-subset S of the set P of points of a (k — 1)-dimen- 
sional projective space PG(k — 1, q), having the property that the intersection 
of S with any hyperplane H C P consists of n —w, or n — w, points. Ap- 
parently, a similar property holds for the complementary set S” = PAS, thus 
defining a k-dimensional two-weight projective code C', of length 


n' = (g* — (a — 1) —n, 


called the complementary projective code of C. Note that the weights in C and 
C’ are related by w, + w = q*^!, with N; = Nj, for i — 1,2, in their re- 
spective weight enumerators (32). It also follows from Delsarte 9), cf. sec. 2.6, 
that, to any pair of complementary two-weight projective codes of dimension 
k, there corresponds, by duality, a pair of complementary two-weight projective 
codes of the same dimension k, with respective lengths N,/(q—1) and 
N./(q—1), where N, and N, are the coefficients appearing in the weight 
enumerators (32) of the original pair. 

We shall now briefly indicate some of the uniformly packed quasi-perfect 
l-error-correcting codes which can be obtained from two-weight projective 
codes, by corollary 12.5. The results are summarized in table I, where, for 
each code, or family of codes, referred to by a number in the first column, we 
list the parameter q, referring to the field GF(q), the length n, the dimension k 
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TABLE I 


Uniformly packed quasi-perfect 1-error-correcting codes 


r(q™—1)/@—1) 
rq” + 1)/(q — 1) 
2" 2 
22m —] 
2-1 (29 — 1) 
(27-1 -0Q"-4 1) 
(2^ — 1) (22"—1) 


(q" — a) (a — 1) 


2m 
2m 


C2) +4@"—9) 
C2) —3 (q" +49) 
0 


23m—-1 HRS 22m ive 2m + 3 
Q""*—D)Q""—10Q"-4H1 
2m-2 Q"71 + 3) (2™ - 1) 
23m-1 — 22m A 2m-1i + 2 

0 
18 
40 
99 

0 

261 

22 
765 
155 
396 


(n — 2) (q — 1)/2 


C2) 
2m™-1 41 
Q=: DOT — 1) 
2m-2 (2m-1 — 1) (2" — 1) 
2m-2 Q"71 + 1) (2™ + 1) 
(2"-1 = 1) (22™ "Te 1) 
2m-1i 
30 
30 
100 
10 
253 
18 
TIO 
132 
420 


n (q — 1)2 


remarks 


(1) 
Q) 
(3) 
(3) 
(3) 
3) 
(3) 
(4) 
(5) 
(5) 
(5) 
(6) 
(6) 
(7) 
(7) 
(7) 
(7) 


9c 
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of the dual two-weight projective code, and the parameters A and u; numbers 
appearing in the last column refer to the following remarks. 


Remarks. (1) It was pointed out by Delsarte *) that two-weight projective codes _ 


of length r and dimension 2 over GF(q") exist for any r in the range 
2<r<q"—l1. 


Hence, the codes of family No. 1 exist, under the same assumptions, for any 
prime power g, and any m > 1. Note that the codes obtained by complemen- 
tarity and by duality belong to the same family. 

(2) The codes of family No. 2 are obtained from the two-weight projective 
codes associated to certain negative Latin-square graphs NL,(q"). They exist for 
a prime power q when r = q"^7! or q"-! — 1, with m z 2, and also when 
r = (q" — 1)/t or (t — 1) (g" — 1)/t, where m = 2', for some integer i > 1, 
and t Æ 1is any divisor of q + 1, cf. Delsarte 7). Note that, as in the preceding 
family, the codes obtained by complementarity and by duality belong to the 
same family. For the sake of completeness, let us mention that some of the 
two-weight codes discovered by Baumert and McEliece ?) belong to this family. 

(3) The families No. 3 and No. 5 are obtained from the two-weight codes 
discovered by Delsarte’), and No. 4 and No. 6 from their complementary 
codes. These codes correspond by duality. From any of them, a binary code 
can be obtained; for example, family No. 7 is obtained from No. 4. These 
codes exist for any m z 2. 

(4) These codes are binary extended perfect l-error-correcting codes. They 
exist for any m z 2. 

(5) Codes Nos 9, 10 and 11 can be obtained from the ternary perfect 2-error- 
correcting Golay code C, of which the dual C. is a two-weight projective code 
of dimension 5. Code No. 11 is obtained from the complementary code of Ct, 
and codes No. 9 and No. 10 are obtained by duality. For more details, we 
refer to Delsarte 7). 

(6) These codes are obtained from the complementary two-weight projective 
codes defined by the 56-cap in PG(5, 3) discovered by Hill +”). They belong 
to family No. 2, for q — 3, m — 3, and r — 4 (No. 12), or r — 22 (No. 13). 

(7) These codes are obtained from the binary perfect 3-error-correcting 
Golay code. Details for the construction of code No. 14 can be found in 
Delsarte 7). The other codes (Nos 15, 16, 17) are obtained by complementarity 
and by duality. 

(8) The codes of family No. 18 are obtained by shortening a perfect single- 
error-correcting code of length (g" — 1)/(g — 1). For q = 2, they are nearly 
perfect 1°), 


The following set of parameters, 
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q—2,n = Q?" — 1/3, A=0, p = (22"-? — 13, 


with m > 3, was found admissible for a uniformly packed quasi-perfect 1- 
error-correcting code. However, we shall show that no code with these param- 
eters can exist (cf. appendix A). 


We conclude this section with a brief description of those uniformly packed 
quasi-perfect e-error-correcting codes, with e >1, which are known to us. 
These results are summarized in table II, where, for each code, or family of 


TABLE II 


Uniformly packed quasi-perfect e-error-correcting codes 


Q?" —4)/3 
Q?" — 4)/3 


(22™" — 1)/3 
(22m — 1)/3 
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codes, referred to by a number in the first column, we list the parameter q, 
referring to the field, the error-correcting capability e, the length n, and the 
parameters A and u. The codes defined in No. 1 are the Preparata ??) codes, 
which are nearly perfect +°) and exist for any m > 2. As family No. 2, we 
have the BCH codes of length 22"7*! — 1, and designed distance 5, which 
are known to be quasi-perfect +1), and can be shown to be uniformly packed 
by application of corollary 12.4, for any m > 2. Code No. 3 was shown by 
Van Lint 15) to be uniformly packed. Finally, codes Nos 4, 5 and 6 are ob- 
tained by puncturing or extending the Golay binary code of length 23, and 
codes Nos 7, 8 and 9 are obtained similarly from the Golay ternary code of 
length 11 *). We should like to mention that codes Nos 1, 2, 4 and 5 were 
already mentioned by Bassaligo et al.?), who also obtained, by a computer 
search, the following admissible sets of parameters: 


*) Note that codes Nos 7 and 8 are actually 1-error-correcting codes. 
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q =2, e =2, n = 70, 2 = 16, u = 10, 

q =4, e =2, n = (2?™+1 4 1)/3, y = à + 1 = Q?"—1y3, for mze2. 
It is still unknown whether codes with these parameters do exist. In appendix B, 
we shall show that codes numbered 1, 2, 3 in table II are the only binary uni- 


formly packed quasi-perfect 2-error-correcting codes, with parameters 4, u, 
satisfying À -++ 1 — p. 


5. Uniformly packed codes of jth order 


In a recent paper, Bassaligo et al.?) proposed the following definition: 
“A code C, with covering radius o(C) = T, is uniformly packed, if there 


exist real numbers «o, 04, ..., Ær, such that, for every ve V, 
T 
d afr) =1 
i-0 


holds, where f;(v) is the number of code vectors at distance i from v." 

If, in the above definition, we replace the covering radius o(C) by the ex- 
ternal distance, we may consider, in view of corollary 1.1, that every code is 
uniformly packed. For that reason, we shall propose a different definition, 
which extends more naturally the original one given for uniformly packed 
quasi-perfect codes. 

Let C be an e-error-correcting code with minimum distance d = 2e + 1 or 
2e + 2, and let D be the set 


D= {ve V| dtu, v) >e +1, Vue Ch. 
For an integer j, with 1 <j < d — 1 — e, let T; be the set 
T,— (veV|3ueC, d—6é—j &d(u v) <e}, 
and, for any v e V, let R,(v) be the set 
Rv) = {ueC|e +1 <du, v) € ed j). 


Then, we shall say that C is uniformly packed of jth order with parameters 
A, p, if 

VveD, [RC] =p 
and 

VveT;, |R| 2 4 


hold. The reader may verify that, for j — 1, this definition coincides with the one 
given for uniformly packed quasi-perfect codes. Many properties of these latter 
codes can be generalized to jth-order uniformly packed codes, with j > 2. In 
particular we have the following generalization of theorem 11. 
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Theorem 13 


An e-error-correcting code C is uniformly packed of jth order with param- 
eters A, u iff, as an element in C(V), it satisfies 


(u S.—A (S, E Sa—e-j—1) + Seg Pd Se) + C= la Sp (34) 
Proof. It suffices to observe that, with D defined as in (23), we have by definition 
(Ser — )*C— B D- A (Se — Sa-e-j-1) * C, 


from which the theorem easily follows. 


Corollary 13.1. (i) Let C be a jth-order uniformly packed e-error-correcting 
code with parameters A, u, and minimum distance d, and let F(x) be the poly- 
nomial of degree e + j defined by the Krawtchouk expansion Y^ «, P,(x) with 
coefficients 


a = l/u fo e+1i<i<e+j, 

a=1—Afu for d—e—j<i<e, (35) 
and 

a =1 for 0<i<d—e—j-—l. 


Then, F(x) is divisible by the characteristic polynomial of C, and 
F(0) = q" |C|"*. 


(ii) Conversely, if F(x), defined by the Krawtchouk expansion with coef- 
ficients (35), is the characteristic polynomial of an e-error-correcting code C, 
then C is uniformly packed of jth order with parameters 4, u. 


The above corollary is a generalization of theorem 12, and can be proved 
in a similar way. We shall omit the proof. The following corollaries are general- 
izations of corollaries 12.2 and 12.3 and are proved similarly. We shall also omit 
their proofs. 


Corollary 13.2. A jth-order uniformly packed e-error-correcting code C is t- 
regular, with t = d(C) — e— j. 


Corollary 13.3. In a q-ary jth-order uniformly packed e-error-correcting code 
containing the all-zero vector, the code vectors of weight k define, for any k, 
a q-ary t-design, where t = e + 1— j ifd=2e+1, or t=e+2—j if 
d= 2e + 2. ; 


We conclude this section with a brief description of some jth-order uniformly 
packed codes which are known to us. The results are summarized in table III, 
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TABLE III 
Some jth-order uniformly packed codes 


1 A=ptl 2 (229 — 1) Q2n-2 — 1)/15 
2 2| 22» —1 [Q27*1—17) (22m-2_1)/15] 2 (22"» —1) Q2"-2 — 1/15 
3 17 10 15 
4 8 9 
5 6 7 
6 45 55 


where, for each code, or family of codes, referred to by a number in the first 
column, we list the parameter q, referring to the field GF(g), the minimum 
distance d, the order j of uniform packing, the length n, and the parameters A 
and u. The codes of family No. 1 are the BCH binary codes of length 
n = 2?"*! —], and designed distance 7, which, for any m > 2, are easily 
shown to be uniformly packed by application of corollary 13.1 (ii). The codes 
of family No. 2 exist for any m > 2. For m = 2, it is the BCH code of length 
15 and minimum distance 7; for m > 3, they are the nonlinear codes recently 
discovered by Goethals ?). Codes No. 3 and No. 4 are binary quadratic residue 
codes *), and codes No. 5 and No. 6 are punctured codes of the self-dual sym- 
metry codes of lengths 24 and 36, respectively, discovered by Vera Pless 1°). 
If a self-dual (72, 36) binary code with minimum distance 16 would exist 22), 
then its punctured code of length 71 and minimum distance 15 would be a 
4th-order uniformly packed code with parameters 4 = 78 and g = 91. Finally, 
we should like to mention that we found the following set of parameters: 


n= (2-1 + e) (2"— e€), with &--1 or —l, 

n 2m-3 P a + €) (22m2 == 1)/3, 

A= p — 2 (2?™"-4 — 1)/3 — e 2-2, 
for which, with m > 3, binary second-order uniformly packed codes of length 
n and minimum distance 5 may exist. However, we do not know whether codes 
with these parameters actually do exist. 
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Appendix A 


We shall prove the nonexistence of uniformly packed quasi-perfect 1-error- 
correcting codes with parameters 


q = 2, n = (22" — 13, 4 = 0, u = (n— 1y4, m z 3. 


Without loss of generality, we may assume that the code contains the all-zero 
vector, and since, by definition, its external distance is 2, we know from 
theorem 6 that its weight enumerator 


Wz) = Y, ez 


is uniquely determined from cy = 1, c, = 0. By straightforward calculations, 
we obtain 


We(z) = |c|-* (a + 2^ + Ni. —2)1 (1+ 277^: + NL 0 —2)2 (1 2775), 
where |C| = 22", jı = (n — 1/2, ja = (n + 1/4, 


N, =n (3n + 1)/(n + 3), Na = 8n/( + 3). (A.1) 


By corollary 12.3, we know that the code vectors of weight 4 form a 2-design 
S,(2, 4, n), with 2 = (n — 5)/4; the parameter r = (n — 1) 4/3 of this design 
has to be an integer, which implies n 4 0 (mod 3), whence g.c.d. (n, n + 3) = 1. 
Since the weight enumerator 


E ef Welz) = > Ci z! 


must have integer coefficients c;, it follows that the coefficients of the poly- 
nomial 


(Ni/Nz) (1 — z: (1. + z7^ + (= zy (0 + 27772. (A.2) 
have to be divisible by (n -+ 3)/8. Let p be a prime dividing (n + 3)/8, and let 
(n + 3)/8 — qp*, with q 250 (mod p). 


We shall show that the coefficient of z? in (A.2) is not divisible by p*, which 
will prove the result. By using the values given in (A.1), we observe that, 
modulo p*, (A.2) reduces to 


[1 — 2) (1 — 2) P7! [(1 — ze» — (1 + z^], 


which is divisible by z?, since 


2 a 
( 17") = 0 (moa for i—1,2,...,p—1, 
i 
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and where the coefficient of z? is 
2 (^4 
—2 ( i ) 0 (mod p*). . 
P 


Hence, the result is proved. 


Appendix B 


We shall show that the only binary uniformly packed quasi-perfect 2-error- 
correcting codes with parameters À and y satisfying 4 + 1 = 4, are those 
numbered 1, 2 and 3 in table II. By theorem 12, we know that the characteristic 
polynomial F,(x) of such a code has the Krawtchouk expansion 


Fe(x) = Po + Pix) + (1/9) Pa) + (0/9) Pa), 
that is, 
6u Fo(x) = (n + 1 — 2x) ((n + 1 — 2x)? — Gn + 1 — 6y)}. 


By corollary 12.1, the zeros of this polynomial have to be distinct integers in the 
interval [1, n]. This is equivalent to the conditions that n is odd and 3n -+ 1 — 6u 
is the square of an integer, v, say, so that we have 


3n + 1— 6u = v?, (A.3) 


where v is even since n is odd. 
From the condition |c] Fc(0) = 2", we deduce ZE 


OLE n S a EA Caa a G E GS) 


Let n +- 1 = pa 2”, n41 +v =p, n+1—v=p,2%, v= pm 2”, and ` 
\c| = p, 2”, when all the p are odd integers. Substitution in (A.4) yields 


pa (3 . 2^** — p, p, p, 2*******) = (2 (v[2)? + 1) 2"*?, 


from which it follows, since a > 1, that 


either a=1 and ytatb+e>n+l, 
(A.5) 
or a>l and ytatb+c=n+1. 


In the first case, the only possibility isytatbt+tc=n+2, pppe— 1, 
andn + 1 = 2p, = v? + 2, which implies 3u = n + 1, contradicting uw < n/3, 
cf. (28). Hence, (A.5) holds, and from (A.4) we deduce 


Py Pa Po Pe = 3H = 4 (3n + 1— v?). (A.6) 


34 J. M. GOETHALS and H. C. A. van TILBORG 


From the definition of the p, it follows that we have 
2^ p, = 2° p, + 2" Dn» 2° p, = 2° Da TE 2" pn» 


from which we deduce that one of the following holds: 


(1) a<m, a—b-c, 

(ii) a >m, b=c=m, 

(iii) a=m, andeithr b=a+1,c>a+l, 
or ec=at+1,b>a+l. 


We shall consider the three cases separately. We first observe that, for any 
n > 3, the following inequality holds: 


2 (3n 4- 1— v2) »? < 5((n + 1)? — v?). (A.7) 


(i) In the first case, a = b = c < m, we have p,/p, > 2 (n + 1 +h, 
PelPm È 2 (n + 1—v)/v and from (A.6) we deduce 


v? (3n + 1— v?) D8 p, pa pu? (n + 1)? — v?) 
which, by use of (A.7), implies 
Py Pa Pw? < 5/16, 


which clearly contradicts our assumption on the p. 
(ii) In the second case, b = c = m < a, we have 


PolPm = (n + 1 + v)/v, DelPm = (n + 1 — v)lv, 
and from (A.6) we deduce 
v? (3n + 1— v?) = 2 py Pa py? (n + 1? — v?), 
which by use of (A.T), implies p, Pa Pm? < 5/4, whence p, = p, = Pm = 1, and 
(v? — (n + 1) (? —2 (n - 1) — 0. 


The solution v? = n + 1 yields n = 2" — 1, u = n/3, that is the parameters 
of family No. 1 in table II. The solution v? = 2 (n + 1) yields n = 2?"-1 — 1, 
u = (n — 1/6, that is the parameters of family No. 2 in table II. 

(ii) In the third case, a — m, we have either b>c=a+1, or 
c >b =a + 1. Let us assume c = a + 1, in which case we have 


PelDm = i (n epus v)/v, PalPm = (n + 1/2, 
and from (A.6) we deduce 
v? (3n + 1— v?) = p, py Pm? (n + 1) (n + 1— 9). (A.8) 
Since v? (3n + 1 — v?) < (3 (n + 1)/2)?, we deduce 
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9 (n 4- 1) 


« AG) ; (A.9) 


Py Po Pm 


and since yz is odd and not equal to 1, we have v? < 3n + 1 — 18, by (A.6), 
whence (n + 1)/(n + 1 — v) < 4/3 for n > 40. Hence, for n > 40, we have 
by (A.9), p, py Pm? < 3, that is p, py Pm? = 1, v = 2", p, = 2°-™—1; and 
from (A.8) we deduce 


Pa (pa — 1) + 2 = 2" (3p, — 2"), 


which has no solution except for m = 3, p, = 3, n = 23, u = 1, which cor- 
responds to the Golay code and has to be excluded. For n < 40, one can 
check by hand that the only solution corresponds to the case n = 11, u = 3, 
that is, No. 3 in table II. The assumption b = a + 1 can be treated in a similar 
way and readily leads to a contradiction. 


Remark. By similar methods, one can show that the only binary uniformly 
packed quasi-perfect single-error-correcting codes with parameters A and u 
satisfying A + 1 = p are the following families appearing in table I: 

(i) family No. 1, with q = 2, r = 2"-1 + 1, m È 2, 

(ii) family No. 2, with q = 2, r = 2"-71 — 1, m È 3, 

Gii) family No. 18, with g = 2, m z 3. 
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CONTRIBUTION TO THE STUDY OF ELECTRO- 
HYDRODYNAMIC INSTABILITIES IN A NEMATIC 
ABOVE A SMECTIC-4 OR -CINEMATIC TRANSITION 


by M. GOSCIANSKI 


Abstract 


Observations are given of electrohydrodynamic instabilities in nematic 
materials of negative dielectric anisotropy e, for which the anisotropy 
of conductivity c, decreases continuously from positive to negative 
values with decreasing temperature. Both classical regimes, the dielectric 
and conduction ones, do not appear under a critical temperature T* 
as predicted by the theoretical models. At temperatures lower than T'*, 
two new regimes of instabilities are observed; the low-frequency one 
may be interpreted in terms of amplified twist fluctuations. A twist 
distortion can be amplified for positive values of the friction coefficient 
&3 and for negative values of c,. In addition, some a.c.-field effects on 
smectic-4 and -C phases are reported; two instabilities regimes are 
Observed in the smectic-C phase. 


1. Introduction 


Periodic distortions in the orientation of a nematic subjected to the action 
of an alternating electric field have been extensively studied both from the 
theoretical and from the experimental point of view ':?:3). Two regimes of 
instability have been shown to exist, depending on the values of the excitation 
frequency in relation to the charge relaxation frequency 1/7: a low-frequency 
regime that shows “Williams domains" and a high-frequency regime charac- 
terized by a “chevron” texture. 

A mechanism based on the amplification of a bend distortion due to the 
accumulation of space charge accounts for these instabilities. Using the equa- 
tions describing the mechanical and electrical behaviour of a nematic subjected 
to an electric field, Dubois-Violette, De Gennes and Parodi (D.V.G.P.) ?) have 
taken up an earlier suggestion due to Carr ^) and Helfrich *) to show that the 
instability mechanism can be described by two coupled differential equations 
for the space-charge density q(t) and the local curvature of the molecular 
alignment y(t). i 

These are the parametric equations 


å + qlt + ony =0, 
V + v[T, + Eq =0 
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where E(w) is the amplitude of the applied field at frequency c, 
£j, €1, Cip O, are dielectric constants and electrical conductivities meas- 
ured parallel and perpendicular to the molecules respectively, 
T, is the relaxation time of the curvature of bending, 
Oy is an apparent conductivity defined by 


EL OL 
Cu —0yjy|—— — 
. . . . E | di ! 
7] is an effective viscosity. 
This system of equations has two types of solution depending on the values of w: 


(a) Conduction regime 
When wt S 1 and o T; » 1, the threshold field for the instability in the 
conduction regime is given by 
Eg? (1 + w? 7?) 
E2 — (1 + o? 1?) d 


where £? is the Helfrich parameter defined by 


pet (1-24) (ctu) 2 
An 9, Ey Eq 
Here ô is a ratio of the coefficients of viscosity of the nematic, 2 and Es? are 
the coefficients that appear in the expression for the relaxation time of the 
curvature. 

Equation (1) gives the limit of validity of the conduction regime: 


(e = 1)1/2 


oO, —————. (3) 
T 


E*(o) = (1) 


The first experimental observations have indicated that the value for the 
wave vector of the instability should be taken as k = z/d (d = thickness of 
the sample), which implies a threshold voltage 


1+ (7 — 1) w/w? i 
Xw) = V2(0) 1E 
OVO ars © 
with 


ey Km? 1 


V*) = G) 


Ej Ea £o Ċ?— 1 f 
The equations (4) and (5) give the existence condition for instabilities in the 
conduction regime: ¢? > 1. 
The validity of expressions (1) and (4) requires that œ T; `> 1. This con- 
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dition is only satisfied if the thickness of the sample is much greater than a 


critical value 
K; T 1/2 
d,=% (=) 
7] 


(K; is the elastic modulus for bending). 


(b) Dielectric regime 

At frequencies for which T,(w) « v the curvature follows the excitation; 
it is therefore reasonable (and this is confirmed by calculation) to consider that 
w T, ~ 1. The charges are then stationary. This is the dielectric regime. Cal- 
culation predicts that the wave vector of this instability is proportional to «!/? 
and independent of the thickness of the sample, and similarly that there is a 
threshold field proportional to w1/?. The equation characterizing the dielectric 
regime is only satisfied for values of the parameter ¢? greater than unity. The 
condition ¢? > 1, necessary for establishing the two regimes of bending in- 
stability, can be written as 


ON ĉa \ En 
2-- ja. (6) 
9, £j Ó/ E 


For any value of 6 this condition is no longer satisfied if oj, < 0, or: 


on En 
—— e. (7) 
OL €, 
We are interested in two nematics *): 
n-(p-butoxybenzylidene)-p-n-octyl aniline (40.8), which gives the following phase 
diagram: 


crystal z——— smectic B —— smectic A —— nematic —— liquid, 
31°1°C 48-6°C 62:5?C 77.8*C 


and the 4-4’ octyloxyazoxybenzene (C): 


solid == smectic C => nematic —— liquid. 
79*C 107°C 125°C 


In fact, because there is a smectic preordering 9), they have a negative con- 
duction anisotropy in a domain of several degrees above the smectic/nematic 
transition (fig. 1). The condition (6) can then no longer be satisfied since 
1 « ejen < 1-1. 

The study of electrohydrodynamic instabilities in these compounds as a 
function of temperature gives a test of the validity of the Carr-Helfrich 


*) These two products were synthesized by Messrs Liebert and Stzrelecki of the Faculty 
des Sciences, Orsay. 
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D eer 70 75 pe 


——» Temperature (°C) 


Fig. 1. Variation of the ratio of the parallel and perpendicular electrical conductivities in 
the nematic phase of 40.8; ory; = 3.107? (Q cm)^!. 


mechanism: disappearance of the two types of instabilities described above 
when £? < 1 (ie. at temperatures T < T*), as well as a divergence of the 
threshold voltage given by (4) when Z£? — 1. On the other hand, the study 
of nematics showing pretransitional effects gives a method of finding out 
whether the system remains stable when ¢? < 1, or whether another kind of 
instability can develop; it also gives an introduction to studies of the effects 
of fields in smectics. 


2. Experimental conditions 


The observation cell consists of a thin film of liquid crystal contained be- 
tween two flat parallel conducting plates separated by insulating mylar spacers. 
We made our observations on samples for which the molecules were aligned 
parallel to the walls in the direction ng by evaporating a thin film of silicon 
monoxide (400 A) on to the plates at an angle incidence of about 80° 7). 
Undesired charge injection at the electrodes is suppressed by coating the elec- 
trodes with a layer of 5000 À of SiO at normal incidence before depositing 
the SiO at oblique incidence. 

The oven used has been described in detail elsewhere 9): the horizontal gra- 
dient in the measurement cell is of the order of 1071 °C/mm. The temperature 
is measured with an uncertainty less than 2 . 107! °C. The dielectric constants 
and the electrical conductivities were measured through the entire nematic 
phase of each of the two compounds °), while we measured the conductivity o, 
on our cell to take into account any deterioration of the sample. 
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The deformations induced in a planar sample by an electric field have been 
observed either with the microscope or by diffraction. 


3. Regime in which ¢? > 1: bend instabilities 


3.1. Observations 


We have four parameters: field, frequency, temperature and thickness. We 
have distinguished two types of regime, depending on the frequency of excita- 
tion: a dielectric regime at high frequency and a conduction regime at low 
frequency, which have different characteristics depending on the thickness and 
temperature. We shall treat the simpler case of the dielectric regime first. 


3.1.1. Dielectric regime 


At high frequency we have observed, close to the nematic/isotropic transi- 
tion Ty;, instabilities that show all the characteristics of the dielectric regime: 
— The threshold of visibility is a threshold field. 

— The deformation is unidimensional; beyond the threshold the instabilities 
show the “chevron” texture. 

— The action of a stabilizing electric field (1 kV/cm at f= 5 kHz) reduces 
the wave vector of the instability but does not affect its visibility threshold. 

— The intensity of the diffracted light oscillates at a frequency twice that of 
the excitation. 

— The square of the wave vector first increases quasi-linearly with c, then 
varies more slowly up to a maximum value (wy œ% 1 kHz) beyond which 
k? slowly decreases. 

Below a critical thickness d, (d, 20 um) the dielectric regime appears at 
all frequencies. For samples of thickness d > d, a limiting frequency w,’ exists 
below which the regime does not appear. This frequency w,’ increases with 
thickness up to a maximum value, which we have obtained for samples of 
thickness greater than 100 um. 

When the temperature decreases, this type of instability no longer appears 
below a temperature T*, which is characteristic of the material. The visibility 
thresholds increase strongly and w.’ — 0 when T — T'* (fig. 2). 


3.1.2. Conduction regime 


We have seen that for small thicknesses (d < 20 um), the dielectric regime 
extends to all frequencies. For samples of thickness d greater than the critical 
thickness d, we have distinguished two regimes depending on the value of 
the ratio d/d,. i 


. (a) did, 1 
With continuous excitation and at low frequency (0, w,’), the characteristics 
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——> Threshold voltage (V) 


0 100 200 300 
——> Frequency (Hz) 


Fig. 2. Variation of the threshold characteristics with temperature; 40.8, d = 60 um. 


of the instabilities are identical to those of the conduction regime in a con- 

ventional nematic: 

— the Williams striations’) appear at a threshold voltage that increases 
sharply when w — ws’; 

— the periodicity of the striations is of the same order of magnitude as the 
thickness of the sample; 

— the action of a stabilizing electric field increases the visibility threshold of 
the instabilities without changing their periodicity ; 

— the time dependence of the diffracted intensity shows that the curvature is 
quasi-stationary. 

Two important modifications arise when the temperature of the sample 
decreases: 

— the visibility thresholds increase; 
— the limiting frequency w,’ between the conduction and dielectric regimes 
decreases (fig. 2). 

On approaching the temperature 7'*, the visibility thresholds of the instabil- 
ities increase sharply whereas the frequency w,’ tends to zero (figs 2 and 5). 
At temperatures below the temperature T* at which the dielectric regime dis- 
appears, the conduction regime no longer exists. 

Whatever the temperature, the period of the Williams striations /1 is always 
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Fig. 3. Temperature dependence of the period of the Williams striations for several normalized 
frequencies; 40.8, d = 110 um. 


less than the thickness d of the sample. The ratio A/d also becomes smaller as 
the thickness is increased. When the excitation frequency increases, the period 
decreases (fig. 3). 

Reduction of the temperature has a similar effect on the periodicity of the 
instability. At a fixed normalized frequency w/w,’, the period decreases very 
rapidly near the critical temperature T* (fig. 3). 


(b) dd. z; 1 


For samples of typical thickness d ~ 50 um the instabilities that we observe 
at low frequencies (0, w,’) have the properties of instabilities in the conduction 
regime, but they no longer develop uniformly along the direction normal to no. 
The first modification that appears with decreasing temperature is the inclina- 
tion of the focal lines at an acute angle on both sides of the direction normal 
to no. If we continue to lower the temperature, the instability takes on the 
appearance of a two-dimensional network of hexagonal cells in which the 
branches normal to n, can link up at an angle (fig. 4). The decrease in the 
dimension of tbe cells in this direction brings about an increase in the apparent 
angle made by the focal lines with the normal no. The focal lines are not visible 
in light polarized perpendicular to no: the molecules lie in the plane (E, no). 
It can be seen that particles of dust suspended in the fluid describe regular 
cyclic paths: they travel from one electrode to the other in trajectories per- 
pendicular to the focal lines. 

This two-dimensional development of the instabilities of the conduction 
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Fig. 4. Modification of the Williams domains: hexagonal cells and focal lines at an obtuse 
angle; Cg, d — 60 um, T — 117?C, 50x. 


regime is very important at low frequencies and at small sample thicknesses. 
It decreases and vanishes when o/c,' — 1 and for thicknesses above 100 um. 
At small thicknesses (d ~ 30 um) we can observe an angle 0 in the whole of 
the temperature domain in which the instability exists. This angle is small near 
to the isotropic phase and it increases when the temperature decreases down 
to T *. Regardless of the thickness of the sample or its temperature, the angle 
between the focal lines and the normal n, decreases when the excitation fre- 
quency increases. It is always zero when c/o, — 1. The two-dimensional de- 
formation exists in samples of 40.8 as in those of Cg; however, the effect is 
more marked in the latter. 

The variation of the threshold characteristics of these instabilities with tem- 
perature is identical to that of the Williams domains: the visibility thresholds 
increase sharply and c, — 0 when T — T+. At temperatures T < T* these 
instabilities do not appear. 


3.2. Interpretation of the experimental results 


We have seen that for any value of the sample thickness there exists a charac- 
teristic temperature T+ of the nematic below which neither the conduction 
regime nor the dielectric regime appear. Moreover, when T — T+, the cut-off 
frequency between the two regimes w,’ — 0. 

For samples of 40.8, for which the conduction anisotropy corresponds to 
the values given in fig. 1, the temperature T+ is 65-5 ^C. We have observed 
the disappearance of the electrohydrodynamic bend instabilities in samples of 
Gs at T” = 1165 "C. 

We have compared our observations with the theoretical predictions of 
D.V.G.P. for samples of 40.8 of thickness d > 100 wm, for which the charac- 
teristics of the regimes are still compatible with the model. For these samples 
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it seems reasonable to consider that the measured cut-off frequency w,’ is the 
limiting frequency of validity of the conduction regime w.. 


(a) Determination of T'*, verification of the importance of ¿2 ' 


(2), requires a knowledge of ih anisotropies of the electrical parameters, as 
well as the ratio of the coefficients of viscosity ô. Since this ratio is not known 
for the nematics used, we have tried to find an order of magnitude, assuming 
that the theory is valid close to Tyr. We have measured the cut-off frequency c, 
and also the electrical conductivity c, (c, = o close to Tyr) in our cell. From 
the values of w, and v = £o e/o, we have obtained the values of ¢? by using 
the relation £? — 1 = c? t?. We have found that £? 7 3 close to Ty, for 
40.8 (fig. 1), and assuming that this theory is valid at T = 72 °C, that ¿2 ~ 2 
at this temperature. From the values of the anisotropies of the electrical param- 
eters we have obtained the value of ô by using eq. (2): ô + 0-12 near Ty; and 
ô ~ 0-24 at T= 72°C. Assuming a normal variation of the ratio ô of the 
coefficients of viscosity in the temperature range (T+, Ty), then putting the 
value 6 = 0-5 into the existence condition for the instabilities (6), and taking 
the values of the dielectric constants and electrical conductivities we find that 
the critical temperature for 40.8 is T* ~ 65-8 °C. The value of ô = 0-2 gives 
a critical temperature T* ~ 66-5 °C, and the value 6 = 1, for which the con- 
dition ¢? > 1 becomes c/o, > 1, gives the temperature T* = 65:3 °C. 

These values agree well with the temperature T+ = 65-5 °C, at which we 
have observed the disappearance of the two instability regimes. 


(b) Verification of the decrease in the cut-off frequency w, with temperature 


The variation of the cut-off frequency for a sample of thickness d>> d, 
agrees with the predictions from the model (eq. (3)). It decreases and vanishes 
with ¢?— 1 at the temperature T* = 65-5 °C at which the instabilities dis- 
appear (fig. 5). 


(c) Sharp increase of the thresholds of the instabilities with temperature 


If we have been able to determine experimentally the temperatures at which 
¢? = ], on the other hand, we have not been able to find out the variation of 
¢? with temperature: in effect, we do not know the ratio of the coefficients of 
viscosity 6. We cannot therefore compare our results with the shape predicted 
by eq. (4) for the temperature dependence of the thresholds of instability. : 

From the relation (4) we have verified the validity of the frequency depend- 
ence of the instabilities of the conduction regime as a function of temperature: 
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— —» Frequency (Hz) 
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Taw 65 70 75 Tur 
——» Temperature (°C) 


Fig. 5. Cut-off frequency we between the two bend-instability regimes as a function of tem- 
perature; we > 0 at T = 65:5?C; 40.8, d = 110 um. 


1 + [2(7) — 1] w?/@,? 


VT, w/w.) = V?(1,0 
(T, ojo) = Y, — 


(8) 
with 
ey KT) K?(T, 0) d? 


V*(T, 0) = — ——————. 
) Eo Ea €, (T)—1 


(9) 


Figure 6 shows the variation of the threshold voltages for the appearance of 
the Williams domains as a function of temperature. Since the cut-off frequency 
varies with temperature, it is necessary to consider the variations in threshold 
voltages at the same normalized frequencies w/w,. 

Whereas the wave vector k in the D.V.G.P. model is assumed constant: 
k = 2/d, we have observed a variation of the wave vector of the instability 
(fig. 4): 


uA 


k(T, wlw.) = A(T, ojo.) . 


We can produce no theoretical model that will explain this variation; how- 
ever, a study of the instabilities produced with alternating square-wave excita- 
tion predicts a periodicity for the instability that is less than the sample thick- 
ness and depends on the value of £? +°). 

We have included the temperature and frequency dependences of the wave 
vector K(T, w/w.) in eqs (8) and (9): 
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Fig. 6. Threshold voltage for the appearance of the conduction-regime instabilities as a 
function of temperature for different normalized frequencies. In the inset the curve 1 [V*(T, w) 
as a function of 

1 K*(T, 0) 


V2(T, 0) k2(T, w) 
shows that the theoretical frequency dependence of the threshold is confirmed at each fre- 
quency; 40.8, d — 110 um. 


KT, co) I + (¢? i 1) w? w? 
k*(T, 0) 1— o?[oj? l 
From the evaluation of 6 and v, and also from the measurement of We, We 

deduce that ¢?— 1 « 1 in the temperature domain where the thresholds 

increase sharply (T < 70°C). This approximation permits us to neglect the 
second term in the numerator of (10). The inset in fig. 6 shows 1/V*(T, co.) 
as a function of 


VT, c/o.) = V*(T, 0) (10) 


1 k*(T, 0) 
V*(T, 0) k?(T, ojo) 


We find the values of 1— w?/w,? for the different normalized frequencies 
from the slope of the curves. The frequency dependence predicted by the 
D.V.G.P. theory is therefore confirmed throughout the temperature domain 
(T*, Tyr) provided that the variation of the wave vector of the instability is taken 
into account. 
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4. 


The ¢? < 1 regime 


4.1. Experimental observations 


of 


At temperatures T — T* we have observed, in accordance with the values 
the excitation frequency, two new instability regimes that can essentially be 


distinguished by the values of the wave vector k of the distortion: a low-fre- 
quency regime (w < «,) with k ~ l/d, and a high-frequency regime (w > o) 
with 1/k ~ a few microns. 


4.1.1. Low-frequency regime 


This regime closely resembles the low-frequency regime of bend instability 


characterized by the Williams domains, but with several important differences: 
— The focal lines are parallel to the direction of orientation no of the molecules. 


At temperatures close to 7* they are slightly inflected on both sides of no, 
while closer to the smectic phase they are straight lines (fig. 7). In light 
polarized at right angles to n, the focal lines disappear: the molecules remain 
in the plane (E, no). 


— The instability is characterized by a threshold field, as shown in fig. 8. 
— There is hydrodynamic motion in the fluid: the observation of suspended 


particles of dust in the nematic shows that there are two types of motion, 
a cyclic motion normal to the focal lines and confined between them, and 
a slower motion parallel to them. 


— The time variation of the intensity of the light diffracted by the deformed 


sample follows the oscillations of the curvature for all the thicknesses of 
sample that we have investigated (30 ym < d < 120 um). 


— A high-frequency stabilizing field (several kV/cm at 5 kHz) increases the 


threshold field for the appearance of the instability, but does not modify 
the periodicity of the distortion. 


Fig. 7. Low-frequency instabilities for ¢2 < 1. The domains are on both sides of the direc- 
tion no of alignment of the molecules; Cg, d = 30 um, T = 115 "s 
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Fig. 8. Voltage threshold for the appearance of focal lines parallel to no for different sample 
thicknesses. 


4.1.2. High-frequency regime 


Above a limiting frequency œ, we observe a regime that has characteristics 
very similar to those of the ordinary dielectric regime, but with one exception: 
the arrangement of the focal lines is two-dimensional (fig. 9). But as in the 
dielectric regime: 


Fig. 9. Two-dimensional aspect of the high-frequency instabilities for £? < 1; 40.8, d= 
30 um, T = 75?C, 200 x. 
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— The periodicities are independent of the thickness of the sample and are of 
the order of several microns. 

— The focal lines disappear in light polarized at right angles to no. 

— The instability is characterized by a threshold field that increases with fre- 
quency. If the temperature decreases, the threshold field increases. 

— An analysis of the time variation of the diffracted light shows curvature 
oscillations: the intensity oscillates at twice the excitation frequency. 

— A. stabilizing field decreases the wave vectors of the instability without 
changing the value of the threshold field. 


4.2. Proposed “mechanism” for the low-frequency instability 


The optical characteristics of the low-frequency regime can only be explained 
by the existence of a periodic-twist distortion with a wave vector perpendicular 
to ng. Such a distortion would indeed be expressed by focals parallel to no, 
which disappear for polarization perpendicular to no, since the director always 
remains in the plane (E, nọ). To take account of this instability, we have to 
think of a mechanism that will amplify twist fluctuations of the molecular 
orientation. 

Figure 10 gives a highly simplified description of such a mechanism. Let us 
consider a twist fluctuation in the orientation of the director in a nematic with 
a negative conduction ansiotropy (c, > oj). In the distorted zones of orien- 
tation the current lines are inflected, the charges prefer to flow at right angles 


E| 4 X 
Fluid velocity 
Fig. 10. Proposed mechanism for the amplification of a twist distortion. Because of the con- 


ductivity anisotropy o, > o the current lines are inflected. The resulting fluid motion pro- 
duces a destabilizing couple if «3 is positive. 


to the molecules. If we assume the existence of a motion of the fluid parallel 
to the current lines, a flow component parallel to nọ and a velocity gradient 
perpendicular to ng appear in these regions. The viscous couple exerted on 
the molecules by this flow is characterized by the friction coefficient «3 1+). 
If œ, is positive, this couple destabilizes, and the fluctuation in the orientation 
of the director can be amplified. In a conventional nematic «3 +?) is negative 
which implies in particular that the molecules have a stable angle of orien- 
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tation in a flow parallel to ng. Recently, however, the existence of nematics 
for which «3 is positive has been demonstrated !?:1*), and it has been predicted 
that, above a nematic/smectic transition, fluctuations in smectic order could 
imply a change in the sign of a3, and even a sharp positive variation of this 
coefficient of friction !^). Using apparatus developed by Pieranski and 
Guyon !?), we have shown that «3 was positive in the entire nematic phase 
of 40.8 (at least up to temperatures in the region of Tyr), and also in the entire 
nematic phase of Cg. The mechanism that we have just described can therefore 
exist in the whole of the temperature range where o, is negative and a3 is 
positive. 

This mechanism only allows us to show that it is reasonable to assume the 
existence of an instability associated with a twist distortion. It is compatible 
with the existence of a hydrodynamic motion parallel to the focal lines, but 
detailed calculations are necessary to give an explicit expression for the thresh- 
old field. If we try to adapt the D.V.G.P. calculations, the problem of cou- 
pling between the space charge and the curvature of the distortions remains in 
its entirety, and can undoubtedly only be resolved by a full calculation in three 
dimensions, since it is likely that the distortion is not a pure twist. In this 
situation we cannot as yet interpret the high-frequency instability. 


4.3. Re-examination of the two-dimensional development of the bend instabilities 
in the conduction regime 


We have seen that for T > T*, the instabilities of the conduction regime, 
while retaining a behaviour consistent with the predictions from the D.V.G.P. 
model as far as the temperature variations of the threshold, ¢? and «c, are 
concerned, can appear in the form of focal lines making an angle @ with the 
normal to n, (fig. 4). 

The one-dimensional mechanism proposed by D.V.G.P. does not take this 
modification into account with regard to conventional nematics. 

Let us consider the fluid motion that becomes established above the thresh- 
old of appearance of the Williams domains. When “boundary effects" are 
taken into account, regions exist in which the flow is parallel to the molecules 
and the velocity gradient is perpendicular to them. In these regions the mole- 
cules are subject to a viscous couple characterized by a3; for positive «4 the 
fluctuations of twist can be amplified. The instability no longer develops uni- 
formly in the direction normal to no: a twist distortion reappears to break up 
the roll into several parts some of which become unwedged in relation to the 
others. 

The angle @ varies with the excitation frequency and the sample thickness 
in the same sense as the velocity of the fluid, which according to Grüler 15) 
is proportional to the wave vector of the deformation k =œ a/d, and decreases 
as the excitation frequency increases. We have seen that the "angle effect" was 
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more marked for thin samples (d ~ 30 um) and that it diminished on increas- 
ing the frequency (0 — 0 when w/w, — 1). 


5. Some observations relating to the action of an electric field on a smectic-A 
or -C phase 


Unlike the effects that are observed in the nematic phase, which are identical 
whatever the type of smectic phase preceding the nematic phase, the effect of 
the electrical field depends on whether the preceding smectic phase was of 
type A or C. 


5.1. Electrohydrodynamic instabilities in a smectic-C phase 


If a nematic single-domain nematic sample of C, is slowly cooled to just 
below the temperature of the smectic- C/nematic transition, a sample of smectic 
C is obtained that includes large domains of uniform orientation separated by 
disinclinations. As far as the effects of the field are concerned, the domains are 
large enough for each one to be considered as an oriented smectic sample. The 
area of the domains of the same planar configuration is of the order of 1 mm?. 

Depending on the frequency of observation, we have observed two instability 
regimes that can be distinguished by the wave vector k of the distortion: a low- 
frequency regime (w < œw.) with k œ l/d and k perpendicular to ng, and a 
high-frequency regime (w > w.) with 1/k of the order of several microns and 
k parallel to no. 


5.1.1. Low-frequency regime 


The appearance of focal lines more or less parallel to nọ seems to be related 
to a threshold field (fig. 11); it is preceded by a hydrodynamic motion made 
visible by the cyclical motion of particles of dust. This threshold of visibility 
is high, of the order of 20 kV/cm. The threshold fields and the periodicity of 
the instabilities just above the smectic/nematic transition are continuous with 
the instabilities of the low-frequency regime in the nematic phase. The thresh- 
olds of visibility of these instabilities increase with decreasing temperature. 


Fig. 11. Low-frequency instabilities in the smectic-C phase of Cg; d — 60 um, T = 106°C, 
40 x. 
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5.1.2. High-frequency regime 


Above a frequency w, a pattern of fine lines appears at the threshold field, 
which are oriented either perpendicular to n, or slightly inclined on both sides 
of the normal to n, (fig. 12). The periodicity of these focal lines is of the order 


Fig. 12. High-frequency instability regime in the smectic-C phase of Cg; d = 60 um, T = 
106 °C, 200 x. 


of several microns, and is independent of the thickness in the range investigated 
(30 um < d < 120 um). The intensity of the light diffracted by the deformed 
medium oscillates at twice the excitation frequency. This regime appears to have 
been observed as early as 1968 +6). 


5.2. Electrohydrodynamic instabilities in a smectic-A phase 


In single-domain samples of smectic-A (40.8) there is no combination of 
voltage, excitation frequency and polarization of the light that produces de 
formation. Only the motion of dust particles shows that there is a roll-type 
hydrodynamic motion. The amplitude of these cycles decreases when the tem- 
perature decreases, and the hydrodynamic threshold increases. In planar con- 
figuration the smectic planes are perpendicular to no; the flow patterns do not 
involve the appearance of any focal lines because of constraints that keep the 
molecules normal to the smectic planes. 


6. Conclusion 


The main purpose of this investigation was to analyse the effect of fluctua- 
tions in smectic order on the development of electrohydrodynamic deformations 
in nematic liquid crystals. 

The materials that we used permitted us to vary the value of the Helfrich param- 
eter £? continuously asa function of the temperature from values greater than 1 to 
values less than 1. We have verified that the electrohydrodynamic behaviour of 
liquid crystals depends very strongly on the sign of the parameter 2? — 1. 

Foi £? — 1 >Q, we observe, as in conventional nematics, the development 
of bend instabilities. When £? tends towards 1 from above, the thresholds of 
visibility of the two instability regimes (conduction regime and dielectric regime) 
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increase and the cut-off frequence w, tends to zero. At a critical temperature 7*, 
for which £? = 1, the bend instabilities disappear. In the whole of the tem- 
perature domain (T* < T < Tyr), the threshold can be interpreted very satis- 
factorily in terms of the D.V.G.P. model. ' 

On the other hand, for ¢? — 1 < 0 the bend instabilities disappear, and we 
then observe two new kinds of instabilities. We have proposed a simple but 
comprehensive interpretation that gives an account of the low-frequency regime: 
a twist distortion can be amplified if the friction coefficient «4 is positive and 
if the ratio of the electrical conductivities parallel and perpendicular to the 
molecules is less than unity. Detailed calculations involving a model in several 
dimensions — since it is likely that the deformation is not a pure twist — are 
necessary to give the threshold field and the two regimes explicitly. 

Although we have been able to separate and classify the different types of 
instabilities observed, the transitions between the regimes have not been so 
clearly established. A. detailed study of the instabilities at temperatures close 
to T* would permit the determination of the existence conditions of the new 
regimes and would facilitate the derivation of a mechanism that gives an 
account of these instabilities. 

Finally, the preliminary observations that we have been able to make on 
smectic-C phases have shown that the characteristics of electrohydrodynamic 
instabilities are very close to those that can be obtained in the nematic phase 
at temperatures slightly higher than Tsy. This leads to the conclusion that the 
experiments performed in the nematic phase for T > Tgy contain much 
information on the behaviour expected from the smectic-C phase: this is all 
the more interesting since proper control of the techniques for preparing single- 
domain layers of smectic-C has not as yet been achieved. 


Laboratoire d' Electronique et Limeil-Brévannes, December 1974 
de Physique Appliquée 
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FIELD AND POTENTIAL DISTRIBUTIONS 
IN CHARGE-TRANSFER DEVICES 


. by H. W. HANNEMAN and L. J. M. ESSER 


Abstract 


A method is proposed by which the two-dimensional potential and field 
distributions in charge-transfer devices can be analytically evaluated, 
taking into account the insulating silicon-dioxide layer. The analytical 
approach includes calculating the effect of (non-mobile) space charge. 
The influence of mobile carriers (charge packets) is assessed qualitatively, 
where possible. The charge-handling capacity and the speed will be 
estimated. Two examples concerning bulk charge-coupled devices dem- 
onstrate the usefulness of our method. An interesting feature is that 
periodicity of the charge-transfer-device structure is not required at all 
for our approach. We will show that non-periodic structures like input/ 
sampling circuits can be analysed readily. 


1. Introduction 


Mobile charges in charge-coupled devices !) move because of thermal diffu- 
sion, a charge gradient and externally induced drift. The latter two mechanisms 
induce electrical fields: the self-induced field and the externally induced field 
(or fringing field) respectively. The self-induced field plays an important role 
during the first part of charge transfer because of the large charge gradients 
then existing. The speed of the device is mainly defined by the time it takes 
to transfer the last vestige of the charge packet. This is largely under the 
influence of the externally induced field. 

The magnitude of the externally induced field has been calculated by several 
authors ?:?:*), Carnes et al.?) have shown with an approximate analysis and 
a computer calculation that fringing fields can be large enough to speed up 
the free-charge-transfer process appreciably. They assumed zero space charge 
in the silicon. McKenna and Schrijer ?) presented a two-dimensional analysis 
based upon a depletion-layer approximation of both surface and buried-channel 
CCD’s and assuming a non-mobile space-charge distribution inside the semi- 
conductor material. Collet and Vliegenthart *) used a similar analysis for the 
PCCD 6) and investigated furthermore the influence of mobile majority car- 
riers (charge packet) upon the externally induced field. The authors ?:?:^) use 
Fourier series to solve the (linearized) equations. 

The assumption that charge packets are not included is quite reasonable if 
not justified in the case of calculation of the estimated charge-handling capacity 
and speed of the device. It has been shown +°) that the charge-handling capacity 
can be calculated by using only no-mobile-charge information of the potential 
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distribution of the CCD. It is important to know the externally induced field 
at the final stage of transfer (and therefore a relatively small charge packet is 
present) because of its bearing on the speed of the device. 

In this paper we present a static, two-dimensional analysis, particularly suit- 
able for bulk-layer charge-coupled devices (buried-channel CCD and PCCD). 
We assume a non-zero, non-mobile space-charge distribution inside the semi- 
conductor. We assume furthermore that there are no mobile minority carriers 
(charge packets) and that the inter-electrode distance is zero. Estimations of the 
maximum charge-handling capacity and the speed of the device are made from 
the calculated potential and field distributions. f 

The mathematical solution presented expresses the potential as a sum of 
potentials of neighbour pairs of electrodes. The solution is exact; no Fourier 
series are used. Non-periodic phenomena such as input sampling and output 
circuitry can be dealt with analytically because the constraint of periodicity of 
for example electrode configuration, electrode voltages and dimensions as 
needed in Fourier series analysis is removed. The computational effort is much 
smaller compared to the Fourier-series approach. Therefore, a (pseudo-)dynamic 
analysis can be provided readily. 

In sec. 2 we present the equations that describe the chosen model of a bulk 
charge-coupled device and discuss the approximations involved. The theory is 
outlined in sec. 3 and the results obtained with our method are discussed in 
sec. 4, using an example of a uniformly doped PCCD, a profiled PCCD and 
an input/sampling circuit. 


2. Basic model and approximations 


A. bulk CCD consists of a layer of semiconductor material (silicon) covered 
with a thin layer of insulating material (silicon dioxide). On top of this layer 
a row of conducting electrodes is deposited. The device is shown schematically 
in fig.. 1. The semiconductor material is doped in such a way that the top part 
is for example an n-type and the bottom part a p-type semiconductor. (If the 
dope profile is reversed, signs of the space charge ought to be reversed in the 
appropriate expressions. Evaluation of the potential and field distributions is 


Fig. 1. Cross-section of charge-transfer device. 
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similar and straightforward.) It is desirable to have the inter-electrode gap w 
as small as possible. This can be achieved by an electrode-overlap structure *) 
or sub-micron electrode spacing and we therefore assume that the inter-electrode 
gap is zero (w = 0). As the width of the electrodes is large compared with their 
length L a two-dimensional model will be quite accurate. It is interesting to 
note that we do not need to assume a periodic structure in the x-direction 
because our mathematical analysis does not require this. 

The potential g, in the silicon-dioxide layer (0 < y < yj), p2 in the n-type 
silicon (y, € y < y2) and g, in the p-type silicon (y > y;) satisfy Poisson's 
or Laplace s equation. Therefore, 


V*?g,2—-—, i=1,2,3 (1) 
€i 

where V? denotes the two-dimensional Laplacian, e, the permittivity and o; the 
space-charge distribution in the appropriate region. It is assumed that the space 
charge in the silicon dioxide is zero (p, = 0) and that e, = e,. The potential 
in the neutral part of the p-type substrate is taken to be zero. The boundary 
conditions can be found by specifying continuity of potential and polarization 
at the respective interfaces. Let V,(x) be the electrode potential (x = 1, 2, 3, 4 
for a four-phase device) and H the average depletion depth in the p-type sub- 
strate, then 


g(x, 0) = a(x) > (2) 
09, op 

Pı = 93, & —6— (y=) (3) 
oy oy 
Op. 0g 

95 = P3, Ž = : (y = 3, (4) 
oy oy 

p(x, y2 + H) — 0. (5) 


Bearing in mind the principle of superposition, (1) and the corresponding 
boundary conditions (2) to (5) are partitioned into two parts. Part I (poten- 
tial ;') is specified by the electrode boundary potential V,(x) and does not 
account for any space charge. The potential satisfies Laplace’s equation 


V? g/ =0 (1a) 


together with boundary conditions (2) and (3). Part II, resulting in a poten- 
tial p, takes the space contribution in the semiconductor into account; 
g" satisfies Poisson's equation 


*) The electrode gap is typically 5 % or less of the electrode length with present-day technology. 
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V2 pt =t, (1b) 
€i 
with boundary conditions 


9105 0) = Pav, 


and (3), (4) and (5). The average electrode potential y, is assumed to be zero 
throughout the paper. (If the average gate potential is not zero then its value 
ought to be added to y," and gl.) 

Obviously, y,' + y; is the solution of the problem stated and satisfies (1) 
to (5). In sec. 3 we will present a new, mathematically elegant and practical 
solution for y,' and a solution g; based upon a depletion-layer approxima- 
tion ?:^), 


3. Theory 


We will deal first with g,'. The typical co-planar electrode configuration of 
charge-transfer devices can be decomposed 7) as is shown in fig. 2a for one 
basic stage of a four-phase CCD. The decomposition contains two elementary 
units: a plate at a fixed potential (V,) and the second unit consisting of two 
plates at zero distance apart with one plate at zero potential and the other at a 
fixed positive or negative potential (V, — V, V,— V, or V,— V, re- 
spectively). All plates extend to infinity. The potential of two infinitely large, 
conducting, co-planar plates at zero distance apart (in vacuum) is (except for 
an additive constant) given by ?) 


AV 
= — (n— 6) (6) 
3 


where AV equals the difference between the plate voltages and 0 represents 
the angle (in the plane z — constant) between the plate and the line joining 
the point of interest P(x, y) and the gap between the plates (fig. 2b). Equi- 


IW Vo V V M 0V-M VV, 
ELS M MI ———— OB 
i |— + AV; 
E——————4À —— 

I I 

0 M - MM 0 -V CERA 

| YY 1 — + AA 
! I 

F0 kk y 0 V-V 0 AA 
| V-Va — * 

Ss 


Fig. 2a. Decomposition of four-phase electrode structure. 
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Fig. 26. The potential and field distributions of a basic building block. 


potentials are planes originating at O. Field lines are concentric cylinders centred 
at O. (It is interesting to note that these potential and field distributions are 
the dual of those of a uniform line charge.) 

Generally, charge-coupled devices show a permittivity interface parallel to 
the plates. We can account for it in the following way. Assume such an inter- 
face at y — y, (fig. 3). Then, we can express the potential in the region 
0 <y x y, as the sum of a contribution (the original) located at the origin, 
and an infinite number of contributions (the reflections or images), located at 
(x, y) = (0,2n y3), n= 1, 2, 3, ... . The reflections at the conducting plane 
y = 0 are total; those at the permittivity interface y = y, partial. The reflec- 
tion coefficient r can be shown to be 


£4— € 
p————.. (7) 
£j + & 
Therefore, we have for the potential in the region 0 < y <S yi 
o 
pu 
p= r"| 9 — arctan ————— | + 
N Ey x 
n=0 
2(n+1)d— 
Torre (=- arctan atnan] (8a) 
x 
Equally, we have for y > yı 
© AF Y 2nd, + 
p= ü-Trr (=— arctan AEN), (8b) 
T 8; x 
^ n=0 


A typical potential distribution is shown in fig. 3. In the case of CCD’s 
r œ 0°5. This implies that (8) converges rapidly. Table I illustrates this in com- 
paring the exact solution of (8) with a one-term approximation (n = 0). The 
one-term approximation has been obtained by scaling it by &,/(1— r). This 
factor is derived by requiring that the potential of the plane x = 0 should 
equal AV/2. 
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V2 


Fig. 3. The potential distribution of a basic cell with a permittivity interface at y = y;. 


TABLE I 


Relative difference (in per cent) between exact solution of (8b) and a one-term 
approximation (silicon-dioxide-silicon interface at y = 0:02 L; L is the elec- 
trode length) 


A CCD can be thought of as consisting of a number of the basic units. 
Therefore, the solution to (1a) can be written as the sum of the solutions of 
the kind of (8a). Using the one-term approximation we obtain for a four-phase 
CCD with plate voltages Vi, V3, V4 and V4, electrode length L and for y > y,: 


95,3 (x, y) = 


1 
= =|. .+(%- VQ) (=- arctan”) +(V¥3— Vj) (=— arctan = ) + 
7 x x—L 


y 
2 e.]e 


The region of interest in the y-direction is limited to one or two times the elec- 
trode length. The series for the potential in that region will converge rapidly. 
It was found that two or three unit cells (8 or 12 electrodes for a four-phase 
device) give sufficient accuracy for a region near the centre electrodes and for 
a depth of one or two electrode lengths. The outer electrodes, which extend to 
infinity, are held at the average gate potential (zero in our examples). 

The field in CCD's without mobile carrier can be split into a component Ep 
due the electrode voltages (the externally induced field or fringing field) and a 


y 
x—2L 


) (V,— ra- arctan 
x 
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Fig. 4. Potential and field distributions in a homogeneous PCCD (bulk CCD) assuming zero 
space charge inside the semiconductor. 


component E, due to the dopant concentration. E, can be found by differen- 
tiating (9) term by term. Further on in this section we will derive an expression 
for Ea. 

The results obtained so far are illustrated in fig. 4: the equipotentials and the 
externally induced drift field (E,) of a four-phase uniformly doped PCCD with 
plate voltages of 5, 0, —5 and 0 V respectively are shown (the thickness of 
the oxide is 0:02 L; its permittivity (e) equals 3-8 while the permittivity of 
the semiconductor material, £,, is assumed to be 11-8). The maximum poten- 

` tial occurs at the silicon-silicon-dioxide interface under the 5-V electrode (like 
for a surface CCD). Generally E, possesses a component in both the x- and the 
y-direction, but under the middle of the 5-V and the —5-V electrode the x-com- 
ponent vanishes. At all other places Ep x will be non-zero and aid to the transfer 
of (negative) mobile charge carriers toward the 5-V electrode. 

We will now include the actual non-zero space-charge distribution in the 
silicon. The field strength E; as a function of the normalized distance to the 
electrode (y/L) is shown in fig. 5a for three values of x/L. The minimum ex- 
ternally induced drift field occurs for x — 0. Using this value of the field an 
estimation can be made of the speed of the device. This estimation is, however, 
rather pessimistic ?). Maximizing the minimum fringing field is necessary if 
high-speed operation is desirable. Z,,,, as a function of the normalized distance 
x/L for three values of y/L is shown in fig. 5b. 


Solution to part II 


We will now discuss the solution of (1b) with boundary conditions as given 
by (2a) and (3) to (5) using the depletion-layer approximation ?). The deple- 
tion depth in the p-substrate is assumed to be equal to the average depletion 
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Fig. 5. The x-component ofthe externally induced drift field (Er, 4) as a function of the distance 


to the electrodes; (a) for three values of x and (b) as a function of x for three values of y. 


depth (H + d,). The average electrode potential and the substrate potential 
are assumed to be zero. The donor-number density in the n-region Np and the 
acceptor-number density N, in the p-region are known figures. Therefore, the 
space-charge densities o; and o4 (the space-charge density of the oxide is as- 
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sumed to be zero) can be expressed as 
040) — 4 N0), 
e0) = —4 N4O), 


(10) 


assuming that both Np and N, are independent of x; q denotes the elementary 
charge. Equation (10) will now reduce to a set of three independent, one- 
dimensional, second-order differential equations. Integrating twice and deter- 
mining the two constants of integration (for each equation) by making use of 
.the boundary conditions yields 


9," = C; y, 
» y’ 
" í,[02,,, §1 E ` 
p" =- fay [Zart ayta (1A) D 
Pf » £5 £2 E2 
y1 ro 70 
Uu 3 , 3 
p= fay [ay +0- m, 
E2 E2 
» 2 »2 


where H is the average depletion depth in the p-substrate and C, an integration 
constant. H and C, are determined from the expressions given in the appendix. 
If the average gate potential pay is not equal to zero, Pay should be added to 
pı"! and p,". The field due to the dopant concentration E, can be found by 
differentiating (11) in the appropriate region. E, will depend like p," @ = 1, 2, 3), 
only upon y, due to the assumption that Np and N, are independent of x. 
Fortunately, this holds true for most CCD’s. 

The complete solution of the problem as stated in (1) with boundary condi- 
tions (2)-(5) is given by the sum of (8) and (11). 


4. Discussion 


The method as outlined in the previous section can be of great help in the 
analysis of the behaviour of both surface and bulk-layer charge-coupled devices. 
In this section we will present the results of an analysis of the potential distribu- 
tion of a uniformly doped and of a profiled PCCD. Also, some aspects of the 
input/sampling circuit of an experimehtal PCCD will be analysed in some detail. 

If we assume the dopant concentration in the p-region and the n-region to be 
independent of the distance of the electrodes, then 


0:0) — 0; and 050) es. (12) 


Now, evaluation of (11) is straightforward. The maximum potential is reached 
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for y = y, where - 


zi Dx oed 
Ym = d, +— C, (13) 
. 02 j 
and the maximum potential V, equals 
; E C2 
Vin = d C, Es j- (14) 
£2 02 


The potential distribution of a uniformly doped PCCD is shown in fig. 6. The 
following data were used to calculate this (in posuere the data related to a 
second example): 
— oxide thickness: 0-1 um; relative permittivity &; = 3°85. 
— thickness of n-layer: 5-0 um (45 um); relative permittivity e, = 11-8; do- 
pant concentration 7:5 . 101* cm~? (7. 101^ cm79); : 
— p-type substrate: dopant concentration 2 . 10*^ cm^? (5. 101^ cm7?). 
It has been found that Ymax = 3:1 um and V, = 6:1 V under the 0-V electrode. 
The depletion depth was calculated to be 11:34 um from the electrode-silicon- 
dioxide interface. Figure 6 shows that the potential maximum occurs near the 
centre of the n-layer. Therefore mobile negative charge stays away from the 
surface states (traps). One-dimensional studies +°) of bulk CCD's which in- 
clude mobile charge carriers have shown that the maximum charge-handling 
capacity can be found by assuming that all the charge is located at the potential 
maximum. The maximum charge-handling capacity of the uniformly doped 
PCCD is 7:8 . 101° cm7? (13-7. 101°) where we assume a 5-V difference be- 
tween neighbouring electrodes. The measured value (second example) is 
1:5. 10!1, A surface CCD could carry 11 (6) times more charge +°). 

It is important to know the field component E, in the y-direction because 
this can give (together with E,) information about how the charge packet will 
be transferred. Ideally, a charge packet should travel occupying the smallest 
possible area and covering the shortest distance. 


y(um) : 
Fig. 6. Potential and field distributions in a homogeneous PCCD (bulk CCD) assuming a 
donor-number density of 7.5. 1014 cm-? in the z-region and an acceptor-number density 
of 2.1014 cm~? in the p-region. 
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Fig. 7a. The field (Ey) of a homogeneous PCCD with space v ieu as a function of Lo for 
x = O(transferring gate), x = 0-4 Land x = Lfor Np = 7.1014 3 N,—5.10!*cm^?, 

dj — 4:5 um (curves 1), and Np = 7:5.101* cm^?, Ny —2. “io” cm^?, dz = 50. um 
(curves 2). 


The field component in the y-direction due to the externally induced field 
and the dopant concentration is shown in fig. 7a for three values of x/L; 
x = 0 is the middle of the transferring gate; x = L is the middle of the gate 
to which the charge packet is to be transferred. The curves labelled 1 relate 
to a p-type concentration of 5.10!* cm^? (as stated in example 2) while 
curves 2 were calculated using N, = 2. 1014 cm^? (example 1). The curves 
are broken at y/L = 0-8 and 1-0 because of the occurrence of the p-n junc- 
tion. The maximum field increases slightly for an increase of N, and the curve 
as a whole is shifted towards the electrodes. The dopant-induced field is linearly 
dependent upon y as can be seen from (11). The externally induced field behaves 
similarly except near the electrode gaps (x/L — 0-4) where we note large dif- 
ferences from linearity for y/L < 0-3. The field near the silicon-silicon-dioxide 
interface is much larger in the low-dopant substrate case. 

The transferring process can be analysed more closely by monitoring com- 
putationally the potential (and field) distributions. Because of the simplicity of 
the expressions for potential and field here derived, the computer time needed 
is moderate. We calculated, for example, the potential distribution of the uni- 
formly doped PCCD 24 equidistant times during one clock period. It was . 
assumed that the four clock voltages varied sinusoidally. The path of the poten- 
' tial maximum during transfer from the 5-V gate to the neighbouring 0-V gate 
is shown in fig. 7b. Only a small bump at $ = 2/4 occurs. As a measure for 
the area occupied by a charge packet we will use the equipotential (Vmax — 1) V. 
Figure 7c shows the equipotentials (Vmax — 1) V for several ¢ with normalized 
location of the maximum. It can be seen that the vertical dimensions of the 
equipotentials remain nearly constant. Therefore, we may conclude that the 
last electron will travel the shortest possible distance during transfer. 
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Fig. 76. The location of the potential maximum as a function of the clock phase. 


Fig. 7c. The equipotential (Vmax — 1) V for various clock phases. 


The potential distribution of a profiled PCCD is shown in fig. 8a. The data 

pertaining to this case are: i 

— thickness of the oxide: 0-1 um; 

— highly doped top layer of 1:0 um (0-6 um) with dopant concentration 
linearly varying from 150 . 101^ cm~> (200 . 1014 cm~?) to 2. 101^ cm~? 
(3.10!^ cm); 

— thickness of n-layer: 7-68 um (4-5 um), dope level: 2. 101+ cm-3 (3. 1014 
cm?) 

— p-type substrate; dope level 2. 101^ cm^? (5. 10!^ cm^?). 


Fig. 8a. Potential distribution in a profiled PCCD assuming that Np varies linearly from 
150. 101+ cm^? to 2. 101* cm^? and N, is equal to 2. 1044 cm- 3, 
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The data in parentheses relate to a second example. Figure 8b shows the field 
component in the y-direction under the transferring gate (x — 0) and under 
the gate to which the charge is to be transferred. The dashed curves relate to 
the data given above; the drawn curves to the data in parentheses. It is clear 
that E, decreases more rapidly in the case of the shallow-top layer. The signifi- 
cance of this can be seen more clearly in fig. 8c. The path of the potential 
maximum while being transferred is nearly a straight line in the case of the 
shallow-top layer while a garland-like curve occurs in the other case. This 
indicates that the charges will be dragged through a far larger area of semi- 
conductor material and will therefore increase the possibility of filling "bulk 


0 | 040608 1 12 
-4 -— m WL 
-8 
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Fig. 8b. The field E, (externally induced drift field and field due to non-zero space charge) 
as a function of y for x = 0 (under transferring gate) and x = L. 
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Fig. 8c. The y-location of the potential maximum as a function of the clock phase. 
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Fig. 8d. Equipotential (Vmax — 1) V for various clock phases. 


traps". Figure 8d shows that the (Fmax — 1)-V equipotential spreads out quite 
a bit in the significant (vertical) direction in the case of the 1-0-um top layer, 
while the shallow-top layer spreads much less. 

The charge-handling capacity can easily be calculated if we assume, as for the 
uniformly doped PCCD, that all charge is located at the potential maximum. 


Vin Pin Ain P An Pr 


Fig. 9. (a) Input and sampling circuit of an experimental PCCD. 
(b) Sampling and clock waveforms. 
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We obtain 4:5.10!! electrons/cm?. The measured value for the second 
example is 3 . 10!! electrons/cm?, which is of the same order of magnitude. 
Due to the fact that periodicity is not at all a requirement in the mathematical 
derivation we are able to analyse for example the input and sampling circuits 
of a CTD. An experimental input/sampling circuit of a PCCD is shown in 
fig. 9a. The electrode waveforms are shown in fig. 9b. In the calculation we 
approximate the input diffusion by a semi-infinite metal electrode. The equi- 
potentials in a uniformly doped PCCD at t = tọ are shown in fig. 10a. A con- 
ducting channel exists under the P,,, and A,,;, electrodes. This can be filled 
with mobile charge carriers until the 7-V equipotential is reached. The previous 
signal-sample well is located under the P, electrode. The potential distribution 


Va(H0V) — Prin (45V) An in+5V) P,(-5V) AloV) P, (5V) Ar (ov) 


p——————MÓ—— ED ar a GE AS TS, ET 
or 
+ 5 B "n 7 N 7 9 
y^ 
a). 
Va(10V) Pyi(-5V) . Atin(5V) Po(- 4-83V) Ax(-1-25V) Pife483V) — A; (1-25V) 


a 


Fig.10. The potential distribution inside the first part of a PCCD; (a) just before a sample 
is taken (t = fg), (b) just after a sample has been taken (t = 1). 
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at f = t, can be seen in fig. 10b. The clock voltages are varied sinusoidally. 
The conducting channel has been broken up now and a charge well has been 
isolated under the 4,,, electrode. The potential maximum under the Ay in 
electrode lies somewhat closer (3-05 um) to the silicon-dioxide interface than 
the maximum under the A, electrode (3-15 um) which would indicate a slightly 
larger maximum charge-handling capacity of the input circuit. 


5. Conclusion 


An exact yet very simple method has been proposed to evaluate CTD’s using 
two-dimensional potential and field distributions. The one-term approximation 
— with an accuracy better than 2% — can be evaluated cheaply and quickly 
on a computer. Non-periodical configurations as for example the input/samp- 
ling circuit of a CTD can be dealt with easily. A (pseudo-) dynamical analysis 
has been demonstrated with some examples. Indications about charge-handling 
capacity, speed and efficiency could be obtained. 


Eindhoven, November 1974 


Appendix 
Evaluation of depletion depth 


The average depletion depth H in the p-substrate of the device can be cal- 
culated by requiring that the total charge between the potential maximum, 
y = Ym, and the p-n junction y = y, is equal to the total charge between the 
p-n junction and the depletion depth. Furthermore, the voltage drop between 
the potential maximum and the electrode is equal to the voltage drop between 
the maximum and the substrate (assuming zero gate potential). 

For the uniformly doped PCCD we find 


Np (y — Ym) = Na H, (A.1) 
and 
d, Ym 1 
Nb Ym | — + — | = — [Np (V2 — Ym)? + N4H?] (A.2) 
& — 26, 2e, 
respectively. 


The integration constant C, turns out to be 


»2 pe H »'! H 
f dy” J (@2/e2) dy + f dy" J (035/62) dy + (y; — H) f (@s/€2) dy 
C = »1 »1 »2 »2 y2 


d; &[&, + di (1 — £1/82) (A 3) 


For the profiled PCCD (assuming a linearly decreasing dope concentration in 
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the Np,, top layer) we obtain in a similar way as above for yo < Ym < Ya 
(fig. 11): 


Noz (2 — Ym) = Na H, . (A.4) 
and 
d Ym d d 
Ny:2 Ym (2 + z) + di (Np,1 — No.2) (2 + =) 
& 2&8 £j 2e, 


Fig. 11. Cross-section (a) and dopant concentration (b) of profiled PCCD. 
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SOME CONSIDERATIONS ON THE IMPLEMENTATION 
OF DIGITAL SYSTEMS FOR SIGNAL PROCESSING: 


by 
T. A.C.M.CLAASEN, W.F.G.MECKLENBRAUKER and J.B.H.PEEK 


Abstract 


A generalization is given of a recently proposed approach for the im- 
plementation of digital systems for signal-processing purposes. This 
realization uses memory devices and accumulators instead of multipliers 
and adders. It is shown that this approach can be applied to very general 
types of digital systems. It is indicated that an exchange is possible 
between the amount of memory, the number of accumulators and the 
speed of the system. Several examples are given in which the concept 
is elucidated. 


1. Introduction 


In the procedure for constructing a digital system three different steps can be 
distinguished: 

(1) the analytical description of the system; 

(2) the synthesis of the configuration of the system; 

(3) the hardware realization. 

For a digital filter for example these three steps comprise: 

(1) the approximation of the desired filter characteristics by a realizable trans- 
mission function (or impulse response); 

(2) the determination of a set of equations realizing the transmission function 
under some additional constraints (e.g. no limit cycles, minimum sensitivity 
with respect to coefficient variations or minimum quantization noise); this 
set of equations will be referred to as the configuration of the filter; 

(3) the translation of the configuration into a digital-hardware realization. 

In this paper attention will be focussed on the third step. A well-known 
method of carrying out this step consists in a direct translation of the config- 
uration into digital hardware. In that case every multiplication and addition 
operation is performed by using digital multipliers and adders. The multiplica- 
tion factors are obtained from a memory, e.g. a read-only memory (ROM) and 
the results of the arithmetical operations are stored in registers +). 

In this realization the multipliers are the most complicated parts of the 
system, for which reason the aim is to minimize the total number of multipliers. 
For the sake of brevity this type of implementation will be referred to as the 
multiply-and-add (MA) implementation. 
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A different approach for realizing transversal digital filters has been given 
in refs 2 and 3 and has also been suggested for the realization of recursive digital 
filters in direct form ?:*). In this approach all multipliers have been avoided 
and are replaced by circuit blocks each consisting of a memory device (e.g. a 
ROM) and an accumulator (further referred to as RAC implementation). It 
has been claimed ?:5) that an increase of speed or a significant reduction in 
cost and power consumption can be obtained over existing realizations. 

Although we share the opinion that this approach can be advantageous in 
certain cases, the possible advantages with respect to conventional realizations 
will not be discussed here, because these will highly depend on the technology 
used. The aim of this paper is to show that this new approach is not restricted 
to filters in direct form ?:^:5), but can also be applied to other digital systems 
and to digital filters with more-general configurations. First a brief introduction 
will be given to the main ideas behind the RAC implementation. Next it will 
be shown that every linear time-invariant digital system can be described by a 
set of equations that can be implemented by means of the RAC approach. 
Finally some examples will be given of systems to which the RAC implemen- 
tation has been applied. 


2. Description of the RAC method 


In order to illustrate the RAC method for implementing digital systems the 
following relation will be considered: 


y(n) = a, x(n) + az x(n) + az xs(n). (1) 


In this relation y(n), x(n), x2(n) and x3(m) are signals occurring in the system 
at time n T and a,, a; and a, are system parameters. 

The MA implementation of eq. (1) leads to the structure given in fig. 1a. It 
requires three multiplications and two additions for the determination of every 
new sample y(r). Without loss of generality it will be assumed that the signals 
are represented by two's complement numbers with N bits: 

N-1 
x(n) = —xjo(n) + 5, xa) 27*. Q) 
i=1 
Here the numbers x,,() can only have the values 0 or 1. With this represen- 
tation of the signals, eq. (1) can be rewritten as 


y(n) = —[a, x1o(n) + a2 xao(n) + az xao(1)] + 
N-1 
+ X) daixu(n) + az xa) + as X3,(n)] 27! 
i=1 
N-1 
= —x(n) + } an) 27*. (3) 


i=1 
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Fig. 1. Implementation of one equation with three variables, (a) by using multipliers and 
adders, (b) using a ROM and an accumulator. 


In this equation «,n) (i —0, ..., N— 1), can only take 23 = 8 different 
values corresponding to the 8 possible three-digit binary sequences (x,,(7), 
x(n), x3,(n)). Therefore eq. (3) can be implemented as follows: the three bits 
x(n), x2,(n) and x3,(n) are used to address a ROM which delivers the corre- 
sponding value of «,(n) represented by K bits. After a shift over i bit positions 
(which corresponds to a multiplication by 277) this output word is supplied to 
an accumulator. The final value of y(n) is obtained after N consecutive addi- 
tions in the accumulator of shifted outputs of the ROM. 

The corresponding structure of the system is shown in fig. 15, where the bits 
x(n) (i = 0, ..., N — D), are applied serially to the ROM, and similarly for 
x(n) and x3,(n). In general every linear equation 


y(n) = » a, x,(n) (4) 
j=1 


can be implemented in this way. This requires a ROM for 2M words of K bits, 
where K is determined by the number of bits by which the system parameters 
are specified, and one accumulator, which produces the output word after N 
consecutive additions. 

The exponential growth of the size of the ROM with the number of variables 
in the equation has always been considered to be a fundamental drawback of 
this implementation. Moreover, the necessity to perform N additions in the 
accumulator during every sample period 7' = 1/f, is a limitation on the maxi- 
mum allowable sample frequency f, of the system. 

Both limitations can be relaxed on the expense of hardware complexity. This 
will be explained briefly below. A reduction of the required size of the memory 


76 T. A. C. M. CLAASEN, W. F. G. MECKLENBRAUKER and J. B. H. PEEK 


can be obtained by partitioning of eq. (4) into P parts ?): 


ynm-—Yax(m4a X ax(ma...H YE ax (5) 


J=1 J=My+1 J=Mp—i+1 


where M, < M; «...« Mp., < M may correspond to any suitable sub- 
division of M into P different parts. Every sum in eq. (5) can now be imple- 
mented by the method described above, and y(n) is then obtained by summing 
the P outputs of the subsystems. The corresponding structure is depicted in 
fig. 2. The bits of the signals x,(m) through x, ,(r) are used to address the first 
ROM and as in fig. 1 these bits are applied to the ROM serially. The second 
ROM is addressed by the bits of the signals xm,+ı(n) through xm,(n) and so 
on. Thus typically the size of the ROMs now is 2V/?, and P of these ROMs 
and P accumulators are required, while still N additions must be performed in 
every accumulator per sample period 7. 

To show the advantage of such a partitioning consider an equation with 10 
variables. Without partitioning the implementation requires 1 accumulator and 
a ROM with 21? = 1024 addresses which might very well be prohibitive to 
realization on a chip. If this is the case, then a partitioning of the equation 
into two equations with 5 variables each would require 2 accumulators and 
2x25 — 64 ROM addresses and one adder to combine the results. It is of 
course also possible to take P = M, which: results in the conventional MA 
implementation since in that case every ROM-accumulator combination is 
equal to a conventional multiplier. 

A reduction of the computation time can be achieved by splitting up eq. (3) 
into Q parts ?:5): 


Fig. 2. Implementation of one equation with M variables with the RAC method after a par- 
titioning into P subequations. 
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N 
y(n) = —e«(r) + X a(n) 27! + y «(n2 -...c  «(2-" (6 
i=Ny+1 izNQ-i1t1 
where N, < Na <... < No_,< N may correspond to any suitable sub- 


division of N into Q parts. 

As before, every sum in (6) can be obtained by addressing a ROM with the 
bits of x,(”) through xy(n), but now such that the bits x,,(m) (i = 0, 1,..., N,) 
are used for the first ROM, and x,,(n) (i = N, +1, ..., N2) for the second 
and so on. The resulting structure is similar to that in fig. 2 and the Q ROMs 
of 2™ addresses each have now the same content. Q accumulators are required 
but every accumulator now performs only N/Q additions per sample period T. 
In the limit where Q — N the accumulators are not necessary because the 
computation is finished after one access to the ROM and the subsequent addi- 
tion of the Q partial results. 

The various trade-offs discussed above are summarized in table I: horizontally 
the consequences are indicated of a partitioning of the equations as in (5) and 
vertically of a splitting as in (6). Clearly an optimum of this procedure can only 
be determined if precise specifications of the system to be designed and of the 
technology used are available. Whatever solution will be taken, the resulting 
structure will be referred to in the following as a ROM-accumulator combina- 
tion (RAC). 


TABLE I 


ROM addresses 2M 
no. of accum. 1 
no. of additions 

per accum. N 


ROM addresses P2M/? 
no. of accum. P 


reduction of ROM size 


no. of additions 


increase of no. of accum. 
per accum. N 


reduction of | increase of reduction of |increase of 
computation | ROM size, computation |ROM size, 
time no. of accum. time no. of accum. 


ROM addresses Q 2M 
no. of accum. Q 
no. of additions 
per accum. 


ROM addr. PQ 2M/P 
no. of accum. PQ 

no. of additions 

per accum. N/Q 


reduction of ROM size 
> 


increase of no. of accum. 
N/Q 


3. RAC implementation of a digital system 


In the previous section only one equation with M variables has been con- 
sidered. In the sequel it will be shown that the method outlined can also be 


p 
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applied to a general linear time-invariant system. It will be assumed that the 
configuration of this system has already been chosen. 

Let the system be excited by input signals u,(m) (i = 1, ..., I) which to- 
gether constitute the column vector u(n). In the same way the output signals 
v,(n) (j — 1, ..., J) can be combined to form a column vector v(n). 

In the following description of the system it is essential to identify two types 
of signals: 

(1) Signals that are results of a sequence of ahmei operations and are 

* used either to form an output or be stored in a register. These will be de- 

noted by y,(n) (k = 1, ..., K) and form the vector y(n). 
(2) Signals that are used as inputs to the arithmetical operations. These are de- 
noted by x(n) (7 = 1, ..., L) and constitute the vector x(n). 
With the signal vectors defined above the system may be described by the set 
of equations 


y(n) — A x(n), (7) 
u(n) 

= B(F)| ---- |, 8 

x(n) ey | (8) 

v(n) — C(F) |- | 2 (9) 
y(n) 


In these equations F is the backward-shifting operator defined by 


F y(n) = y(n—}), 
F* y(n) y (n— K). 


The matrix A includes all arithmetical operations that must be performed in 
the system. This means that to form x(n) and v(n) from y(n) and u(n) no ar- 
ithmetical operations are necessary. Consequently the matrix operators B(F) 
and C(F) must be such that in each row only one entry is different from zero. 
This nonzero entry equals F* where k is any non-negative integer. If desired, 
simple operations such as multiplications with + 2" can also be taken into 
these matrix entries as a factor since they can be implemented with very little 
hardware. 

Equations (7) through (9) are uniquely determined by the configuration of 
the system. Equation (7) can be implemented according to the RAC approach 
by considering every row individually and proceed as in sec. 2. 


4. Applications 


In this section the ideas behind the described method will be elucidated by a 
number of examples. 


- 
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Example 1 ` 


As a first example the sine-cosine generator described in ref. 6 will be con- 
sidered. The structure is given in fig. 3a. There is no input, but 2 outputs. With 
the terminology introduced in sec. 3 this system is described by 


»9-| 5 AED 


x(n) = B ME 


v(n) = y(n). 


In the MA implementation this generator requires 4 multipliers and two adders, 
whereas it is clearly seen that with the RAC implementation only two ROMs 
each having 2? = 4 addresses and two accumulators are required, see fig. 3b. 
While this may lead to a reduction of required hardware it should be observed 
that a change of one of the system parameters requires an entirely different 
content of the ROM. This feature is characteristic of the RAC approach 5) 
and therefore it is e.g. not very suitable for the implementation of adaptive 
systems. 


Fig. 3. Implementation of a sine-cosine generator, (a) with the MA approach, (b) with the 
RAC approach. 
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. Example 2 


In this example a wave digital filter with a unit-element reference filter will 
be discussed. This filter is depicted in fig. 4. The boxes S, through S, are so- 
called adaptors in which the arithmetical operations take place. A detailed de- 
scription of wave digital filters and their properties can be found in ref. 7. 

In this filter eleven signals have to be computed and are denoted by 
yi(r), ..-, yii (n). Eleven different signals are used to compute these signals: 
x(n), ..., x1,(n). The adaptors will be characterized by their 3x3 matrices 
Si, S, and S, and the 2x2 matrix S4. The system is described by eqs (7)-(9) 
with 


1000000000000 
000F000000000 
00000F0000000 
0000100000000 
00000000F0000 

B(F-|0000000000F00|, 
0000001000000 
000000000F000 
0000000100000 
00000000000F0 
0010000000000 


0010000000000 
bs ooocosstesPi]: 


Fig. 4. Wave digital filter with unit-element reference filter. 
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> 


The matrix A has many zero entries and it is advantageous to realize the com- 
plete system by implementing every adaptor equation separately by the RAC 
method. In that case every adaptor consists of three (only two for S3) ROM- 


accumulator combinations. In contrast to the original concept of adaptors `` 


proposed by Fettweis this approach does not require a distinction between 
series and parallel adaptors 7). A difference is only found in the content of 
the ROM. This reasoning applies to all wave digital filters. It can also be 
noted that the property of losslessness is preserved in this implementation, 
since every adaptor produces again the same signals as in the MA approach. 


Example 3 


As a third example various configurations of a second-order digital filter will 
be discussed. These are the transpose direct form 2, the direct form 2, which are 
both canonic configurations), and the non-canonic direct form 1 used in ref. 5. 
The corresponding arithmetical operations are given by the following matrix 
equations and the RAC implementations are shown in fig. 5: 


a, a, 0 0 
Yaln) = [403 G4 1 0 x(n), 
as 0 0 1 


Fig. 5. Three possible RAC implementations of a 1 second-order digital filter; in direct form 2 
(transpose), (b) direct form 2, (c) direct form 1. 
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[1 b 5.0 
nod =[ 5 b, b, p, [m 


yen) = [e c2 c3 c4 cs] x(n). 


All coefficients a,, b;, c, can be determined from the transmission function. 
The realization shown in fig. 5b demonstrates that not all signals that are stored 
in registers need be outcomes of an arithmetical operation. Last but not least 
it is worthwhile noting that the non-canonic form in fig. 5c is optimum in the 
sense of minimum number of accumulators. 


Example 4 


As a last example the ladder structure of Mitra and Sherwood ?) will be 
considered. A four-stage filter is shown in fig. 6a and according to the defini- 
tion of sec. 3 the following description is obtained: 


a, 0 az 0 0 
0 a 0 a, 0 
y(n) =} 0 0 a3 0 as xa(n). 
0 0 0 a, —üs 
Q, 424g da3ü4 A4dg Qsdo 


The RAC implementation of this filter is given in fig. 65. In the MA imple- 
mentation 9 multipliers and 8 adders are required. As can be seen from fig. 6b 
5 RACS suffice to realize the same filter. Due to the occurring chains of two 
multipliers the word length may increase too much. If this must be avoided then 
quantizers must be placed at the nodes indicated in fig. 6a. These node signals 
must be considered as auxiliary outputs resulting in the following equation: 


a,0000010 0 
00000101 0 
00000010 1 
00000001-4 
y(n) =| a, 0 0 O O a6 a, ag as |X (n). 
0a000000 0 
00a00000 0 
000a0000 0 
0000a,000 0 


From this equation it can be seen that the signals at the encircled nodes only 
depend on one signal each, and therefore the RAC implementation of this part 
of the filter is identical with the MA implementation and requires 5 multipliers 
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b) 


Fig. 6. Ladder structure by Mitra and Sherwood; (a) with MA implementation (the encircled 
nodes are possible places for quantizers), (b) RAC implementation. 
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and 4 adders. One ROM-accumulator combination is required to form the 
signal y,(n). - 
5. Conclusion 


It is shown that the ROM-accumulator approach proposed in refs 2-5 for 
the implementation of digital filters in direct form can also be used for the 
implementation of general digital systems. The advantages and disadvan- 


tages that might accrue from this new implementation depend highly on the 
specific structure of the system under consideration and of the technology used 
for the actual realization. 
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HURWITZ SEQUENCES OF POLYNOMIALS 


by Y. GENIN 


Abstract 


The algebraic properties of the ,Polynomial recurrences P;4.3(p) = 
= p P,(p) + cj? P. (p), where cj? are positive constants, are reviewed 
and discussed. Such sequences of polynomials are shown to satisfy 
some algebraic relations of the same type as the orthogonal poly- 
nomials and to play a central role in the problem of reducing a stable 
matrix by similarity to some canonical forms. 


1. Introduction 


It is well known that any Hurwitz polynomial P,(p) of degree n can be im- 
bedded into a unique sequence of Hurwitz polynomials P,(p) defined by the 
recurrence relation 


Pi+1(p) = p Pp) + ej? P, i(p) 


where the c,? are positive constants. 

In this paper, a discussion of the main algebraic properties of such Hurwitz 
sequences is presented; in particular, it is shown that the polynomials P,(p) can 
be obtained in closed form as functions of the coefficients of P,(p) and that 
two relations of the Christoffel-Darboux type are satisfied by the sequence. In 
addition, the continuant matrix associated with the above polynomial recurrence 
is proved to have the Schwarz form or the Puri- Weygandt form depending on 
the normalizing factor of the sequence. Finally, the similarity transformations 
reducing any stable matrix A to a direct sum of Puri-Weygandt blocks are . 
investigated and the Hurwitz sequence associated with the characteristic poly- 
nomial of the matrix A is shown to play a central role in the solution of this 
problem. 


2. The Hurwitz sequence of a Hurwitz polynomial 
Let P,(p) be a monic real polynomial of degree n: 
P,(p) = Y bi p". : (1) 
i=0 


If P,(p) is a Hurwitz polynomial, its coefficients b, are known to satisfy the 
Routh-Hurwitz constraints (k = 1, ..., n): 
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b, 1 
b b, ^ b 1 

4, — |.* 1 2 >0 (2) 
bak- 1 bak- 2 b, 


so that the constants cg, c,?, ..., c, ,? defined by 
0 


Co =A,, (3) 
its Ai. Aa 
AE S 


are all positive real. It is then known ':?) that the polynomial P,(p) can be 
imbedded into a sequence of Hurwitz polynomials P,(p) of degree i verifying 
the recurrence 


Pi+:(p) = p Pp) + c? Pi-1(p), i=l,...,2—-1, (4) 
initialized. by 


P =]; 
o(P) (5) 
Pi(p) =p + co. 
Since by dividing (4) by P,(p) one has 
c 2 
Pus (PP) = p + —— — — (6) 


P,(p)/Pi-1(p) 


this sequence of polynomials enjoys the important property that the ratio 
between any two successive polynomials in the sequence is a positive real 
fraction. 


3. Determination of the Hurwitz sequence associated with P,,(p) 


Let us consider the polynomial A (p) of degree k: 


b, 1 
b, b, b, 1 
Ap) = (-D*| : (7) 
boi bui bi 
p* Ec (-1)* 
obtained from A,,, by replacing its last row by the row vector: 
[—1)* p*, (—1)*-? p*71, LIIS ;—D 1]. (8) 


If the cofactor of the entry (i, j) in (7) is denoted by Ai,» one has by the Jacobi 
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theorem ?) 
EP Ák+1,k+1 ru Annet Aki = (—)y*t Ax» A,(p). (9) 
Since the following values for the 4, , are easily deduced from (7): 
Ags = (—1)"+! A,-2(p), 
Ags 1,k+1 = Ay, (10) 


Ag ket = p A. (p) 
ETE . = (—1) A-1 


(9) can be rewritten as 
Ap) Pp A,-1(P) Ar-3 Ax A,--2(P) 


BE , (11) 
| ALS A Agin ded 
which, by comparison with (3) and (4), proves the identity 
Ap) 
P(e) = — (12) 
Ax-1 


and provides at the same time a way for calculating the Hurwitz sequence 
associated with P,(p) as a function of the coefficients of P,(p) only. 
Incidentally, note that eq. (12) gives for k =n: 
Ap) 
Pig) = -4— (13) 


n-1 


so that, if H stands for the nx (n + 1) matrix, 


b, 1 
b, b, b, 1 
ma et (14) 
b,.s 
b, b, 


and H” for the square matrix deduced from H by deleting its jth column, 
one has the determinal identities 


[aro = bj-1 Ass. (15) 


4. Christoffel-Darboux formulas 


If AC), ACP), fo(p), ... is any sequence of orthogonal polynomials, the 
polynomials f(p) are known ^) to verify the so-called Christoffel-Darboux 
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formula, i.e. 


: oe DS AD KD fo-0) 
5 ; i . " és . fa H p PR q d 


(16) 


i=0 


where the h, and k, are some constants. It will be shown hereafter that two 
similar relations are satisfied by a Hurwitz sequence of polynomials. 
To see this, consider the recurrence (4) rewritten as 


Pi+1(p) = P,-1(2) PD) (17) 
cjp c. p 7 ej 


By multiplying both sides of (17) by (1)! P(q)/q c1? c5? .. . c; -,?, one has 
Prss(p) PQ) | Pr-1(p) P, Pp) P, 
ic »[ (P) P) Pi-1(P) Pal) |- m yore e (a) 


2 2 2 2 2 (18) 
€, -..C Pd = Cyt... Ci pq eq 
or vers by permnting p and q, 
P P P,- P P,(p)P 
Cy E tt. 1@) (D 1~1(9) (p) |- (1! n ii . 09 
ies O pq. y uoces pu C17. Pp 


os (19) from (18) and summing for i = 0 up to n — 1 yields 


i P, Piri P, xd Pisi 
C3 y On 2- Ye jp 
e^ pg 


Pip) Pi-1(Q) — Pii P, 
P (p) P:_1(9) (p) e] (20) 


- <C- pq 
and finally 


-1 


E Cay pum py: P,(P) Pn—1() — Pn- (P) Pala) 


21 
cuu e 


i-0 


In particular, if p; and p, are two distinct zeros of P,(p), one deduces 
P P : 
> x (1 cT to) (CN A Q2) 


Gt eq. | QD is ‘multiplied by p—q fad: differentiated rtimes with respect to 
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p, and s times with respect to q, one obtains - oe. 
n-1 ee: oe Y . 
P,@71) p,G P,© )P =q) 
, P Cc) i (p) P, NN ; c» 1P (p) P, LR 
€2...e0? e? ...c P» Y 
i-0 i-o à 


A Pr Pj f 7 : 
+ (p—4) =) ee 2 = 
Cy ^... Ci 
, P OD) P, X4) — P, ., (p) P(g) ^ | d 


- Cy "Q3 


2 2 2 
617657... 05-4 


Hence, if the multiplicities of p, and p, as distinct zeros of P,(p) are larger than 
r and s respectively, the following relation is derived from (23) by recurrence: 


n-1 
p,o p,G . god ug ^ 
p? c (py) Pi (px) oH 4 IDEM 
Oy? 024 xn CY E 2 
i=0 U 


4 


For p — q, (23) simplifies into 
n—-i n-1 | l tol 
P,@-) p,G p, P,G-1) 
y (—1) i (p) P(r) a y = 1 (p) P, (p) E 
l | C17... e;? C17...¢;/7 
i-o i-o 


P,OX(p) Pa 18Xp) — P,©(p) P, Xp) 


2. 2 
€1* 7. Cam 


Q5) 
=(=)": 


Again by recurrence on r and s, its solution is easily found to be (s < m) 


n-1 ! 
Ps) P,® 
Ye» Pp) PMP) _ 
e? ...cj? ' 
i20 


. (26) 
n-1 (m — s)! s! Nx PA? (p) Pai O( p) — P, (p), , "7 ** 9(p) 
dur e) eain rir E 


t! (m— t 4- 1)! 
t=0 
ex "6 


In particular, if p; is a k,-multiple zero of P,(p), one derives the felation «<+ 
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n—-i 


e ye Cay P; ™-(p;) PO) Z 
(m—s)!s! c?...c 


iz0 
0 for k,—12m, 


m-kjt1 


Pom tt) P 2 (t) 
» Pm D Pan PD eae 2(k;—1) 2mzk,—1. 
(m — t 4- 1D! d! 


t=0 (27) 


(—1)""? 


Note that, in the above equation, the right-hand member does not depend 
on s and therefore takes the same value for all values of s compatible with 
the constraints 


O<s<cm<2(k,—1)). (28) 


By multiplying both sides of (11) by P,(—p) and retaining the even part of 
thé result, one derives the relation 


Py s(p) P(—p) + Pra s(—p) Pp) = cj Pp) Pi-1(—p) + Pi-1(p) P(—p)] (29) 


which proves, in particular, the invariance of 


P,(p) P,-1(—p) + P(—p) Pn—1(p) = 26 C1?» - 6,7. (30) 
Hence, at a zero p, of multiplicity k;, P, (pj) takes the value 
29 0)? ... 64.1? 
ie) = — —, (31) 
P.(—pj) 


so that the first non-trivial equation (27), i.e. m = k,— 1, can be expressed as 


n—-1 
| > cay BEC» no) 
s!(k,— s— 1)! e... m 
i20 


(-1)"-* P,*)(ps) Py. 1(pj) 
e e s Ead k,! 
P,“)(pj) 
k,! P, (P3) 
2cy d*'/ P, 
eer E P) ) f (32) 
k! dp*s P,(—?) p=Dy 


In addition to (21), a Hurwitz sequence of polynomials obeys another 
formula of the Christoffel-Darboux type. By adding (18) to (19), multiplying 


=> (—17: 2co 
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the result by (—1)! and summing for i — 0 up to n — 1, one derives the relation 
n—-i 


J Pp) Pa) E 2co a P,P) Ps i(q) + P4-1(p) Pr(q) 


61^. : p+q C1? C3? o o eb pat (p+4) 


eee) 


‘êi 
i=0 


which again has the same structure as (16). In particular, if P; and p, are two 
zeros of P,(p), (33) becomes 


n-1 
Pip)P 2c 

2. Ra ees 0 (34) 
Ci”... Oy Py + Pk 

i=0 


By using the same argument as the one leading to (23), one can easily establish 


the relations 
J 


n-1 " 
P,*Xp;) P,G | 2c (r+ s)! i 
2 (Dy) Pi (px) = (yt o ) (35) 
G oer (pj py : 

1-0 


provided the multiplicities of p, and p, as zeros of P,(p) are larger than r and 
5 respectively. 


5. Hurwitz sequences in matrix form 


It is a classical result in matrix algebra ?) that a continuant matrix can be 
associated to any continued fraction. Since the polynomials P,(p) of the con- 
tinued fraction (6) are connected through the recurrence (4), one has: 


(1, P,(p), poe ode J P,~1(P)] (Pin + B) = [0, 0, $2 39 0, P,(p)] (36) 


where the continuant matrix B, 


, (37) 


—l 


is said to have the Bückner form ?) (circuit theory) or the Schwarz form $) 
(control theory). By defining the diagonal matrix C as 


C= Cy C2 (38) 
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the matrix B can be similarly transformed into 
H = CB C~! (39) 


where the matrix 


T= — (40) 
Ca—1 
—Cg-1 


has been first introduced in ref. 2 and will be called hereafter a Puri- Weygandt 
matrix. 
. If (36) is differentiated k — 1 times with respect to p, one obtains the system 


p Orie 0 P,(p) 
lp P,(p) 
2p D(p, k) + D(p, k) C! I1 C = (41) 
(k—1)p Ose ee 0 P,*?(p) 


where D(p, k) is the following k xn matrix: 


1 Pp * es. es Py-1(p) 
a) (1) 
Pci Pp) sso Fee UD 


Since the first matrix in the left-hand member of (41) can be similarly trans- 
formed into a Jordan block: 


p p —1 
1 p p — 
-1 = 
zu m 7,— "e (43) 
(k — 1)p p 
by the transformation T, defined as 
1 
. —l 

T, = 2! (44) 


(—D*-1 (k — 1)! 


the following matrix equation is satisfied at any zero p, of multiplicity ky 
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of P,(p): 


T, D(py ki) CH = E T, D(p, ky) C^*,(45) 


—PDi 1 
—Di 


which proves that the eigenvalues of a Puri-Weygandt matrix I are the zeros 
of P,(—p). Moreover, II is totally defective since the cofactor of the entry 
(1, 2) in (p 1, — 77) is different from zero for all values of p, and hence is 
similarly transformed into its canonical Jordan form 


J= THT! (46) 
by the transformation 


T4, D(pi; ki) 
T= fu D(p;, ki) > (47) 


where pi, P2, ... are the zeros of P,(p), assumed to have the multiplicities 
kis Kz, .. . respectively. 


6. Reduction of a stable matrix to a Puri-Weygandt form 


A matrix A is said to be stable if and only if its characteristic polynomial is 
a Hurwitz polynomial. In ref. 9 it is erroneously claimed that any stable matrix 
. can be transformed into a Puri-Weygandt matrix by a real similarity trans- 
formation; moreover, the transformation matrix is explicitly given as the prod- 
uct of two transformations, A being first reduced to its companion matrix 
form. In this section it will be shown that a stable matrix can always be trans- 
formed into a direct sum of Puri-Weygandt matrices by a real similarity trans- 
formation, which in addition can be obtained in one step with the help of the 
Hurwitz sequence associated with the characteristic polynomial of A. 

It is well known ?) that two real matrices are similar if and only if they have 
identical invariant polynomials or equivalently identical elementary divisors in 
the field of the real numbers. Since a Puri-Weygandt matrix is totally defective, 
so must be A. Let then [1, P,(p), P2(p), ...] be the Hurwitz sequence associ- 
ated with P,(p) = det (p 1, — A). If the transforming matrix X, 


A X = —X II, (48) 
is written as 


x= [x;, X22.” Xn] s (49) 
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the n vectors x, must clearly satisfy in view of (40) 


AX, + co X1 — C1 X; = 0, 


(50) 
AX, + €, Xii — 6 Xi, = O. 
Hence, the vectors x, up to x, can be expressed as functions of x, by 
P,(A) 
Xi. 1 Rs 9 i—1l,...,n—1, (51) 
€1€25... Gy 
so that X becomes 

X = pa, P) xi, PD) x1] C7? (82) 


where x, is a vector such that X is non-singular but otherwise arbitrary since 
the last equation (50), 


A Xn + C424 Xy, = O, 
implies 


A i (A) X, Pi- 2(4) X, P,(A) X, 
— t E 


Cy Ca. ee Cn-1 Cy Co... - Cn-2 C1 C5... Cay 
=0, (53) 


in view of the Cayley-Hamilton theorem. As a result, any real stable matrix A 
has thus been proved to be similar to a Puri-Weygandt matrix if and only if it 
is totally defective; moreover, the transformation depends on n arbitrary param- 
eters, namely the components of x,. 

Since a real stable matrix A is in general not similar to a Puri- Weygandt 
matrix, let us investigate the possibility of reducing A by similarity to a direct 
sum of Puri-Weygandt blocks: 


AX — —X IT', 
W =, +i, +... + Th. 
All the IZ, being totally defective matrices, necessary and sufficient conditions 
for the existence of such a transformation are clearly: 
(a) det(p1,— 4) = det (p 1, + I7"); 
(b) P;,n,(p) = det (p 1,, + 7j) — where n; is the dimension of the jth block — 
is a product of elementary divisors in the field of the real numbers of A 


corresponding to distinct zeros. 
By defining k Hurwitz sequences of polynomials: 


P; aP) — p + Cjo 
Pjasi(P) =p P; (p) + cj Pja-1(P) j=l,..., k; i=1,..., Maly 


(54) 
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the same argument as above can be used to prove that X is composed of k sub- 
matrices X, of dimensions nx; of the type 


X, T [x;, P; (4) Xj.. P,5,-1(4) x] Crt, (56) 


where C; has the form (38). The k vectors x, must be independent and chosen 
so as satisfy 
P, (4) x, = 0, (57) 


which is always possible since the rank of P,,,,,(A) is at most equal to n — n;. 

Hence, reducing A by similarity to a direct sum of Puri-Weygandt blocks is 
always possible and the number k of blocks needed is at least equal to the 
number of invariant polynomials and at most to the number of elementary 
divisors of 4. 

Note that for k = k,;,, IZ’ is not unique since the invariant polynomials 
assume some divisibility relations among them which need not be satisfied; on 
the contrary, for k = Kya, each P; , (P) is necessarily equal to one elementary 
divisor of A, and JZ’ is therefore unique within an arbitrary permutation ef 
the blocks. It can also easily be proved that 
(a) kmax = n if and only if all the eigenvalues of A are real and distinct so 

that one has at the same time kyu, = 1; 
(b) kmax = Xmin = 1 if and only if A is totally defective and has only one eigen- 
value, which is therefore real. 

Since, from the above results, it turns out that a Jordan block is similar to 
a Puri-Weygandt block, let us compute the explicit form of the entries of this 
block in the cases of a real eigenvalue and of a pair of complex conjugate eigen- 
values. . 

Consider the symmetrical Jacobi polynomials of degree i and index n defined 
as *) D 


Ri'~"(p) = 


G 4- r)! G 4 n)! (=) (58) 


ir!G—ri@at+r)! 2 


and which are known to be orthogonal and to ae the recurrence relations 


2i+ 1 j2 
LT SRE) de REM). (59) 


+1 (P) = TE TENET, 


For i — n, one deduces 
n 


n,—n |O27Qn— DM n\ (p—1\ 
ia aera? | or 


r=0 


2n—1)!! ; 
at = x cus (60) 
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so that one has 


Pap) = (61) 
if the monic polynomials P,(p) are defined as 
i! 
Pp) = ———— — Rr" (p). (62) 


Qi — 1)! 


Hence, the entries of the Puri-Weygandt block associated with the Jordan 
block built on the real eigenvalue p, are found to be ?) 


Co = Hh pi (63) 


Let us now consider the complex conjugate case 


Pp) = [p? + (pi + pi) p + pi bi* (64) 


By normalizing p to the modulus of p,, one has 


Palp) = (p? + 2p cos 0 + 1)" 
2n 


= Y, C; "(—cos 0) p', (65) 


i-o 


where the C;"(—cos 8) are the Gegenbauer polynomials of degree i and 
index —n (ref. 4): 
i 


n n 
C; "(—cos 0) = ( ) (, ) cos [(i — 2j) 6]. (66) 
J/ \t—] 


J=0 


The entries dj? of the associated Puri-Weygandt block can then be obtained 
by evaluating the Hurwitz determinants of the polynomial (65), which leads 
to complicated rational functions; as an example, the first dj? are found to be: 


do = 2ncos 6, 
2 


jem = 4 cos? 6, 
d= n?—1 RET 1 + [(n? — 4)/3.5] 4 cos? 0 l i (67) 
3 1+ [(@? — 1)/3] 4 cos ?0 
n? —4 1 + [(n? —9)/5.7]4cos? 8. 1 + [(n? — 4)/3.5] 4 cos? 0 


ds? = 4 cos? O ee Er eC p aM 
3.5 1 + [(n? — 4)/3.5] cos? 0 1 + [(n? — 1)/3] 4 cos? 8 
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The dj? can alternatively be obtained by the following procedure. Since it is 
known !) that 


++p)" p 
pe a wl qi 
(P1*—(—1* « Ne á |< 


p+... (68) 


Co c5? 


the transformation p = (p? + 1)/2p cos 0 must produce the two continued 
fractions 


(p? + 2p cos 0 + 1)" + (p? —2p cos 0 + 1)" 


(p? + 2p cos 0 + 1)" — (p? —2p cos 0 + 1)" i 


241 Co pP? +1 ce? pP +1 
ee p ee +1 ug +... (69) 
2c p cos 0 c1? 2p cos 0 Co C5? 2p cos Ü 
p do d,? 
=— +1 | —p +1 T... 70 
d, IE ia? (70) 


which prove that the d,? can be deduced from the cj? (eq. (63)) by transforming 
(69) into (70). This problem is classical in network theory and amounts to 
derive the Cauer realization of a lossless 1-port from its band-pass realiza- 
tion 5); its solution can be obtained iteratively as described in the appendix. 


Appendix 


Assume the band-pass realization of a lossless l-port of degree 2n to be 
known: 


Ly Cı p? +1 L, Cap? +1 L, Cap? +1 
oa [aS [Aer 


ipio. AD 
Cip L, p C3 p 


and consider the problem of deducing the Cauer realization of Z: 


zZ=np+i uri | arti ret... (A.2) 


from the elements L, C,. Since from (A.1), one easily derives 


Z=Lp+1 e Ca pil (A.3) 
= 11D Go . 


which yields 4, = Ly, y, = C, C,/(C, + C3), the Cauer realization of Z can 
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be obtained iteratively, provided the elements L; C,;, of the band-pass 
realization of Z, are determined as functions of L, Cy. By using elementary 
Norton transformations, the relations between the L, C, and the Lii; Cj, 
are readily found to be 


Lzi-1,1 Liss 


L3 Lyi 
C211, ZA Ci 
C Cu 
2051 2142 (A.4) 
L. C (C me C ) Li Lui 
21—1,1 *2i-1,1 = \Y2i-1 2) 5$ 
Lj + Lii 
à Coins Cora 
Lois C211 = (L5, + La. 1) y E ERE) 
Coins + Carre 
initialized by 
a (A.5) 
1,1 = . . 
C; + C; 
MBLE Research Laboratory Brussels, February 1975 
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INTRODUCTION AND REMOVAL OF 
HYDROXYL GROUPS IN VITREOUS SILICA 


PART I. INFLUENCE OF THE MELTING CONDITIONS 
ON THE HYDROXYL CONTENT IN VITREOUS SILICA 


by G. H. A. M. van der STEEN and E. PAPANIKOLAU 


Abstract 


Bubble-free vitreous silica was obtained by melting crystalline SiO, 
powder at atmospheric pressure in an H} (H2/He) atmosphere at 
1950 °C. The hydrogen dissolves with hydroxyl groups being formed 
according to the reaction 


3 SiO2 + Hz xx2 zSiOH + SiO. 
liq. gas diss. diss. 


When the temperature is lowered the equilibrium shifts to the left which 
may give rise to bubble formation. Reactions between H20 and vitreous 
silica result in physically identical hydroxyl groups according to the 
reaction ; 


2 SiO2 + H20 «x2 =SiOH. 
liq. gas diss. 


At the melting temperature (1950°C) an equilibrium between the 
hydroxyl concentration in the glass melt and the hydrogen partial pres- 
sure of the surrounding gases is prevented by a reaction on the glass 
surface: SiO? + H2 — SiO + + H20}. The hydroxyl concentration 
in vitreous silica is determined by 

(a) the H, partial pressure of the surrounding gas during melting; 

(b) the H20 partial pressure of the surrounding gas during melting; 
(c) the sintering properties of the raw material; 

(d) the thermal history of the glass; 

(e) the water content of the raw material. 


1. Introduction 


Previous investigations have shown that it is possible to introduce water and 
hydrogen into vitreous silica in the form of hydroxyl groups, by treating the 
glass with water-containing or hydrogen-containing gases at elevated tem- 
peratures 175). These reactions appear to be mainly diffusion-controlled with 
two different diffusion coefficients. 


, 
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In the earlier investigations use was made mainly of commercially available 
vitreous silica, in which the hydroxyl concentration may vary from practically 
zero to approx. 0-1 weight ?/. These differences in hydroxyl content must be 
ascribed to differences in: 

(1) production processes (H, and H,O partial pressures), 
(2) the raw materials used, 
(3) finishing processes. 

This investigation was initiated to study the influence of these factors. For 
this purpose a method was developed for melting crystalline SiO; under con- 
trolled conditions. 


2. Experimental 


2.1. Equipment 


The equipment, as shown in fig. 1, has been developed to melt crystalline 
SiO, to vitreous silica under controlled conditions. The required melting tem- 
peratures were obtained by high-frequency heating of the molybdenum crucible. 
For reasons of temperature homogenization radiation shields were found to be 
necessary. The temperatures were measured by means of an optical pyrometer. 
Small holes (diam. 2 mm) were made in the radiation shields, so that the optical 


Optical path for 
temperature «-- 
measurements —- A050; single-crystal plate 

-—— Gas inlet from mixing 
apparatus 


Molybdenum heat shields 


O— Hf apparatus 
Molybdenum crucible 


Molybdenum standard 


t=, Gas outlet 


Fig. 1. Apparatus for melting vitreous-silica specimens. 
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path was interrupted by only a window of crystalline aluminium oxide and a 
vitreous-silica prism. We are aware that it is very difficult to determine the 
‘absolute temperature in this way; however, in the event it was possible to 
stabilize the temperature within 10°C. The melting-equipment is connected 
to an installation with facilities for controlling the composition of the gas 
mixtures. 

Small quartz crystals are melted in various gas mixtures at a pressure of one 
atmosphere. The controlled composition of the gas mixture is essential for 
obtaining a bubble-free vitreous silica within reasonable time. Melting quartz 
crystals enclose the surrounding atmosphere and form a bubble. The velocity 
at which a bubble rises is a function of the viscosity: 


$zr4og-—ó6szmnrv, 


where v = velocity (cm/s), 
Ae = density of the glass minus the density of the gas (2-2 g/cm’), 
g = acceleration due to gravity (cm/s?), 
7] = viscosity (poises), 


r = radius of the bubble (cm). 
In the temperature range where the melting processes occur (from 1800 to 
2000 °C) the viscosity is still very high (7 = 10° poises), so that the rising 
velocity of a bubble, size 100 microns, amounts to approx. 3.1075 cm/min. 
Consequently gases of poor diffusion and solubility (N,, Ar, CO;) remain in 
bubble for very long periods of time; H5, He and H,O, on the other hand, 
are absorbed rather quickly by the molten mass of glass. Hydrogen and water 
react with the molten glass and dissolve in the form of hydroxyl groups. The 
inert helium is physically soluble and has a relatively high diffusion coefficient. 


2.2. The melting procedures 


The empty Mo crucible after firing for 5 minutes at 1950 °C and cooling 
was filled with quartz powder. First the system was evacuated and then the 
selected gas mixture was allowed to enter slowly. This was repeated once. 
During heating up and melting a constant gas flow of 2 l/min was maintained. 
The high-frequency apparatus was switched on and switched off abruptly after 
the melting time had elapsed (quenching). The molybdenum crucible was then 
dissolved in hot concentrated aqua regia, so that a quartz-glass ingot remained ; 
this was polished bi-laterally, leaving a slide of approx. 3 mm, which was 
suitable for infrared-absorption measurements. 


2.3. Specifications and pre-treatment of the raw material 


Selected Brasilian rock quartz (nominal weight 4 g) was washed for 1 hour 
in a diluted HCI/HF solution (5% HF, 5% HCl). After acid-free rinsing and 
drying, the crystals were heated up to 800 °C and then dropped into cold water. 
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As a result of the ensuing thermal stresses the crystals break, and are thus easier 
to grind in a ball mill, containing vitreous-silica marbles. 

The coarse crystalline powder was washed in a hot (80 °C) 10% HCI/5 75 HF 
solution. Then acid-free rinsing and drying followed by a firing procedure 
(during 1 hour at 1000 °C in oxygen) took place. The particle-size distribution 
of the raw material obtained in this way is shown in table I. 

The concentrations of the impurities were determined by neutron-activation 
analysis. The result for the main impurities are given in table II. 


TABLE I 


Particle-size distribution of milled Brasilian-quartz crystals 


size (um) percentage 


TABLE II 


Main impurities in the raw material (SiO;) in parts per billion 


80 


Na 590 | 
Ge 1100 | 
Li 7000 *) 

3000 *) 


*) Atomic absorption spec. 


2.4. IR-transmission measurements 


The intensity of the absorption band at 2-73 microns was determined by 
means of a Unicam Sp 700 A Spectrophotometer. The absorption peak for 
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unassociated OH vibration occurs at about 2-75 um in most hydroxylic com- 
pounds. 

The fon value is defined as the optical density per mm, and is a measure 
of the relative hydroxyl concentrations in the samples. The absolute hydroxyl 
concentration can be calculated via the relation 


[on] = Bou Mon. 10 
Ey Og1 
where [OH] = hydroxyl concentration (g/g glass), 
Bou = optical density per mm (mm^), 
Ep = molar extinction coefficient (1 mole~+ cm~+), 
Mos = molecular weight of hydroxyl group (g mole~+), 
Og = density of the glass (g1~+). 


3. Results 


3.1. High-temperature reactions between hydrogen and molten SiO, 


As postulated above, a nearly bubble-free vitreous silica can be obtained by 
melting small quartz crystals in an H,/He atmosphere. This method can also 
be used to study the influence of the H, partial pressure on the hydroxyl con- 
centration in the vitreous silica. The samples are melted in an H;/He mixture 
in the way described above. The Boy value as a function of the hydrogen partial 
pressure is presented in fig. 2. In our opinion the increase of hydroxyl groups 
is possibly a result of the following reduction reaction: 


3 SiO, -H,zx2zSiOH + SiO. (1) 
liq. gas dissolved dissolved 


The hydroxyl groups and possibly also the silicon monoxide are incorporated 
in the glass structure. 


100 


& 
e 


Poy value. 10° (mmi!) 


0 
0:0 0:5 1-0 
——> Hp partial pressure (atm) 


Fig. 2. Variation of the Box value for specimens melted at different hydrogen pressures. 
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Bell, Hetherington and Jack ?:?) propose a different reaction mechanism, 
according to which trivalent silicon atoms are formed. According to this 
mechanism the samples should contain considerable quantities of paramagnetic 
Si?* ions. Up till now, however, we have not succeeded in detecting this 
paramagnetic ions by means of ESR measurements. 

Various tests have indicated that at the melting temperatures no equilibrium 
is reached between the reaction products in the vitreous silica and the sur- 
rounding gases. This is shown by the following experiments. 


Experiment I 


A sample was melted in a 99% He/1% H, atmosphere at 1950 °C. After 
15 minutes this flow was replaced by a H, flow for 60 minutes. No perceptible 
increase in the Boy value was established. 

During the 75 minutes the experiment lasted, a large part of the quartz glass 
had reacted. This phenomenon gave rise to a trial series, the evaporation rate 
being determined as a function of the hydrogen partial pressure (see fig. 3). 


E 
Š 


T =1900°C 
Prot Fi Py, =latm 


Evaporation rate 
(mg/min cm?) 


e 
Ò 


0-0* 
0-0 0-5 ro 
—— H; partiol pressure (atm) 


Fig. 3. Evaporation rate of vitreous silica in mg/min cm? as a function of the hydrogen 
partial pressure. 


The temperature dependence of the evaporation rate is shown in fig. 4. The 
conclusion may be drawn that the penetration of the hydrogen into the molten 
glass mass is probably prevented by a reaction occurring on the molten-glass 
surface, whereby the volatile reaction products are drained with the gas flow: 


SiO, + H, — SiOf + H,OF. (2) 


This reduction reaction has been studied by several investigators +°-15). It is 
likely that a similar reaction takes place on the surface of an included gas 
bubble. In this case, however, the reaction products cannot be drained, but 
-will dissolve in the quartz glass: 
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; 2000 1900 1800  1700(*C) 
10 


10° 


Evaporation rate (mg/min cm?) 


0-40 0-45 0:50 
— 5 I03/ T (K) 


Fig. 4. Temperature dependence of the evaporation rate of vitreous silica at different hydrogen 
pressures; X hydrogen: nitrogen = 1: 0, 

e hydrogen : nitrogen = 27 : 73, 

hydrogen : nitrogen — 0: 1. 


2 SiO, + SiO + H,O $S 2 =SiOH + SiO. (3) 
. gas gas diss. diss. 


The nett result of the two consecutive reactions (2) and (3) is the same as that 
of reaction (1). 


Experiment II 


A sample was melted in a 100% H, atmosphere (15 minutes at 1950 °C). In 
this case the H, flow was replaced by an He flow (60 minutes at 1950 °C). 
The fou value in the quartz glass did not perceptibly decrease in this procedure. 
Presumably the diffusion of hydrogen in vitreous silica follows a mechanism 
in which the physically dissolved hydrogen molecules (and not the hydroxyl 
groups) play an essential part +°). 

The concentration of the physically dissolved H; is determined by the equi- 
librium constant K of reaction (1). As during melting in an H, atmosphere 
no H, is supplied to the bulk (experiment I), a local equilibrium is reached 
in the bulk between the physically dissolved and the chemically bonded hydro- 
gen. There are two indications that the concentration of the physically dissolved 
H; will be lower than the saturation concentration at 1 atmosphere, viz. 

(a) During melting in an H, atmosphere the bubbles filled with hydrogen dis- 
appear completely. This shows that at this temperature the vitreous silica 
is not yet saturated with H3. 
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(b) Results to be published later 8) have shown that in the temperature range 
between 1200 and 1500 ?C an equilibrium can be reached between the sur- 
rounding atmosphere and the hydrogen in the quartz glass. In this tem- 
perature range the evaporation rate of the quartz glass has become small 
with respect to the diffusion rate of the H}. When these data are extra- 
polated to 1950 °C, the calculated foy value will be considerably higher 
than the experimentally found value. 

When the concentration of physically dissolved H, in the vitreous silica is 
small, its transport by diffusion will also be small; consequently the hydroxyl 
concentration during the experiment (60 minutes) will hardly decrease. 

The results of the above-mentioned experiments lead to the conclusion that 
in an early stage of the melting process a quantity of hydrogen is included, 
which will hardly change during the melting process. This would mean that 
the sintering properties and thus the particle-size distribution of the raw 
material must play an important role. At a certain stage of the sintering process 
(which precedes melting) a quantity of hydrogen is entrapped between the 
grains. In general, sintering properties are strongly influenced by the particle- 
size distribution of the raw material. To investigate this, the raw material was 
split up into five sieving fractions. These samples were melted in an H;/He mix- 
ture. The effect on the fo, value of the glass samples is demonstrated in 
table III. 


TABLE III 


Influence of particle-size distribution on the Boy value of vitreous silica, after 
melting in a 50%H,-50%He mixture 


Bou value of 
vitreous silica 


particle size (um) of 
the quartz powder 


300-600 
200-300 68-7 . 10-3 
160-200 | — 629.107? 
100-160 552. 10-5 

63-100 36-6 . 107? 


table-I distribution 


In view of this model, however, a linear dependence between the H, partial 
pressure and. the foy value could be expected. For the rather strong deviation 
which is found between the expected linear dependence and the experimental 
results (fig. 2), in our opinion, the following possible explanation can be given: 
(a) The sintering properties of the crystalline material are also determined by 

the sintering atmosphere. 


HYDROGEN IN VITREOUS SILICA 111 


(b) The raw material itself is a hydrogen source (though a constant one); see 
further sec. 3.4. 

(c) Within the finite quenching period of the sample, the reverse reaction can 
partly proceed while physically dissolved hydrogen is formed. This H, is 
not detected with the IR-transmission measurement. 


3.2. Hydrogen development and bubble formation during cooling 


On account of the temperature dependence of the equilibrium constant the 
equilibrium state of reaction (1) will be influenced during the cooling process. 

The equilibrium strongly shifts to the left, which appears from tests in which 
quartz glasses with a high fon value were fired under H, atmosphere at 1000 °C 
for 24 hours. The fon values of samples thus treated were decimated. A typical 
example is the following. 

A sample was melted in an H, atmosphere at 1950 ^C for 15 minutes. By 
cooling down the melt to 1700 ^C, maintaining it at this temperature for 
5 minutes and then quenching it, a bubble-rich quartz glass is obtained (fig. 5). 


Fig. 5. Bubble pattern of a rapidly and a slowly cooled vitreous-silica sample. Melting con- 
ditions: temperature: 1950 ^C, time: 15 minutes, pressure: | atm H3. 

(a) Melting procedure followed by rapid cooling. 

(b) Melting procedure followed by heating for 15 minutes at 1700 ^C and rapid cooling. 


A micro breaking device, included in the circuit of a gas chromatograph, was 
used to determine the hydrogen pressure in bubbles of different diameter (cf. 
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table IV). A shift of equilibrium during cooling causes physically dissolved 


hydrogen to be formed. According to Henry's law, 


P = c[H;] (physically dissolved), 


when the hydrogen concentration becomes large, the corresponding pressure 
will exceed atmospheric pressure and a tendency towards bubble formation 


will arise. 


Generally bubble growth in a liquid is described with two coupled differential 


equations !*): 
an ordinary differential equation 


RR, dR dm dR " 
dt? (S) xar) 


and the partial differential equation 
oc R? dR 0c d?e 
— + =D 
dt r? dt or 
where R = bubble radius, 
7] = viscosity, 
c = surface tension, 
o = density of the melt, 


2e AP T 
Ro 0 
2 0c 

+ B z) (5) 
r or 


AP = pressure differences (inside and outside the bubble), 


c = hydrogen concentration, 
r = distance from the centre of the bubble. 
TABLE IV 


Hydrogen pressure in bubbles of different sizes. The bubbles are grown at 


1700 °C at 1 atm H, pressure for 15 minutes 


bubble 
diameter 


(um) 


H, pressure (atm) in the 
bubble at 1700 °C (calculated 
from room-temperature 
pressure in the bubble) 
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In our case eq. (4) can be simplified to the extent of neglecting the first two 
inertia factors. This approximation is justified because of the fact that the 
viscosity of the liquid is very high (appr. 10? poises) and the bubble diameter 
is very small (10-100 ym). Equation (4) then changes into 


us + ad = AP (6) 
R R 
with the following solution: 
30) = id (1 — exp =) + aor. (7) 
Ro Ry AP 4n 45 


Table V gives the maximum size (calculated according to eq. (7)) which a bubble 
with an initial diameter of 10 microns can assume. The initial diameter of 
10 microns has been chosen because a quenched sample contains a small 
number of bubbles with a main diameter of 10 microns which will probably 
act as a nucleus. Comparison of the experimental values from table IV with the 
calculated values from table V warrants the conclusion that diffusion equation 
(5) plays an important role in bubble-growth kinetics. To solve the mathematical 
problem accurately would be very complicated because of some unknown 
factors. 


TABLE V 
Maximum bubble radius, calculated according to eq. (7), with: Ry — 5.107* 
cm, £ = 900 s, ø = 400 dynes/cm, 7; = 10? poises, and AP is assumed to be 
constant during growing 


AP (atm) | Rooo (um) remarks 
1-0 bubble should have shrunk 
1-63 5:0 no bubble growth 
2:0 9:4. 10 
3:0 2:0. 10° 
10-0 4-1. 1019 


3.3. High-temperature reactions between water and vitreous silica 


Water was introduced into the system by wetting the applied H,/He mixtures. 
With the existing equipment a maximum dew point of 25 °C can be reached. 
In this trial series the H,/He series ratio was kept constant (23 : 77) and the 
water partial pressure was varied between 0 and about 0-03 atm. 

The glass samples prepared in this way were fired in a vacuum atmosphere 
for several hours at 1000 °C. The hydroxyl content as a function of firing time 
of a sample treated in H;/He and in H,/H,O/He is given in fig. 6. The con- 


114 G. H. A. M. van der STEEN and E. PAPANIKOLAU 


100 


Temperature: 1000°C 


value . 10? (mm) 
OH 


“Stabilized” hydroxyl groups 


0 10 20 
— Firing time (h) 


Fig. 6. Bou value of vitreous-silica samples prepared by ourselves and fired in vacuum at 
1000 ?C. 


a: Thickness of the sample: 1-5 mm, melted in a dry 77%He/23 ?/;H; mixture. 


b: Thickness of the sample: 1:5 mm, melted in a wettened (dew point 22 °C) 77 “He/23 ;H 5 
mixture. 


[^7] 
5 S & 


— X fop value.10? (miri) 
e 


10.107? 20.10? 30.107? 
——— Fy oltm) 


Fig. 7. Boy value of vitreous-silica samples melted at different water-vapour pressures, after 


firing for 24 h (1000 °C) in vacuum. The hydrogen-helium ratio was kept constant during 
melting (23 : 77). 


centration of the remaining (“stabilized”) hydroxyl groups proves to be de- 


pendent on the applied water-vapour pressure (cf. fig. 7). Probably the fol- 
lowing reaction will take place: 


2 SiO, + H20 = 2 zSiOH. (8) 
liq. gas diss. 
The essential difference between the reactions (1) and (8) is that with reaction 


(1) not only are hydroxyl groups formed, but also a reduced centre. This 


reduced centre is oxidized at 1000 °C by the decomposing hydroxyl groups, 
while hydrogen is formed. 


The hydroxyl groups originating from the type-(8) reactions are not com- 
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pensated for by an equivalent number of reduced centres, and as a higher- 
valency situation of the silicon and/or a peroxide formation are highly im- 
probable, physically dissolved water will inevitably be formed during dis- 
sociation of the hydroxyl groups. As known from literature ?) the diffusion 
coefficient for water at 1000 ?C is about a factor 1000 smaller than the same 
coefficient for hydrogen. Probably this is the reason why the OH groups 
originating from a reaction with molecular water are more difficult to remove 
from the quartz glass. Experiments in which H, and H,O in the surrounding 
gases were replaced by D, and/or D5O, respectively, have shown that type-(1) 
and type-(8) hydroxyl groups are completely interchangeable. 


3.4. The raw material as hydroxyl source 


When the crystalline quartz powder is melted in an evacuated hydroxyl-free 
vitreous-silica ampulla, in an arrangement which bears much resemblance to 
the arrangement as shown in fig. 1, a bubble-free quartz glass is formed with 
a foy value of 20. 107? mm~+. These hydroxyl groups apparently belong to 
the category which is hard to remove (reaction (8)). After 24 hours of firing 
at 1000 °C in vacuum the Boy value has hardly changed (Boy = 19 . 107? mm^!) 

As during the melting process H, and H,O were excluded from the system, 
the hydroxyl groups must originate from the raw material. This was confirmed 
by using various raw materials, as a result of which the Boy value varied between 
10. 107? and 70. 107? mm-!. This gives rise to the question why the glasses 
that were prepared in a dry H,/He atmosphere contain no (or hardly any) 
“stabilized” hydroxyl groups. A probable explanation is the following. 

As long as the quartz crystals have not yet been completely sintered or melted 
a certain product of the already initiated reduction reaction can be drained 
relatively more quickly via the pores: 


sol. gas gas sol 


partly disappears via the pores (e.g. as a result of a temper- 
ature increase), 
partly stays behind the pores and later dissolves in the 
form of hydroxyl groups. 
Almost the entire SiO formed may stay behind in the bulk, e.g. as a condensate 
or included in the network. . 
According to this mechanism 


[oxidator (= OH)] < [reductor (= Si?*)]. 


The hydroxyl groups carried along by the raw material can now (partly) com- 
pensate for the shortage of oxidant. This model also explains the phenomenon 
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that the formation of "stabilized" hydroxyl groups (as a result of adding water 
to the gas) becomes perceptible only with Py, > 107? atm (fig. 7). 


3.5. Behaviour of the helium during the melting process 


When the He partial pressure was increased to above 0-5 atm, the melting 
times required to obtain a bubble-free quartz glass became increasingly longer. 
Under such conditions the vitreous silica is probably saturated with He. This 
finding gave rise to the trend of the bubble pattern being studied as a function 
of the melting time. Figure 8 shows the quartz-glass samples that were melted 
in a 99% He atmosphere. Attention is drawn to the fact that after some time 


Fig. 8. Bubble patterns of vitreous-silica samples. Melting conditions: temperature: 1950 °C, 
gas mixture: 99 %He/1%H, (1 atm). 

(a) Melting time 5 minutes, rapid cooling. 

(b) Melting time 10 minutes, rapid cooling. 

(c) Melting time 30 minutes, rapid cooling. 
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^ 


the smaller bubbles disappear, while the nominal diameter of the remaining 
bubbles increases. Finally also the larger bubbles disappear. 
The surface tension is the driving force for these shrinking and growing 


processes: ` 


20 
AP = —. 
R 


The gas in the small bubbles will have a higher absolute pressure than the gas 
in the large bubbles. The concentration of the physically dissolved helium 
around the bubbles adapts to the He pressure in the bubble (according to 
Henry’s law). These concentration gradients make gas transport possible. As 
the mutual distances between the bubbles are much smaller, on the average, 
than the distance between a bubble and the melting surface (or crucible wall), 
first of all there will be a mutual exchange of gas between the bubbles, as a 
result of which the smaller bubbles dissolve and the larger ones grow. At a 
later stage the larger bubbles also dissolve due to gas transport to surface and 
crucible wall. This process, however, is perceptibly slower on account of the 
relatively long diffusion paths and small pressure gradients. 


4. Conclusions 


(a) Important factors contributing to the hydroxyl content of vitreous silica 
are: 
(1) the hydrogen and water partial pressures of the surrounding gas during 
melting; 
(2) the particle-size distribution of the raw material; 
(3) the hydroxyl content of the raw material; 
(4) the heat treatment after melting. 


(b) There are probably two reasons why equilibrium between the melt and 
the surrounding hydrogen- and water-containing atmosphere is not reached: 
(1) A reaction at the glass surface SiO; + H, > SiOt + H5Of prevents the 
penetration of hydrogen into the molten glass mass. The removal rate of 
the reaction products was found to be linearly proportional to the hydrogen 
partial pressure. 

(2) The removal of hydrogen out of the melt is a very slow process, the con- 
centration of physically dissolved hydrogen being very low; therefore mass 
transport by diffusion is bound to be very small. 


(c) Because of the temperature dependence of the equilibrium constant K 
of the reaction 
3 SiO, + H; xx 2 zSiOH + SiO 
diss. diss. 


bubbles can be formed by hydrogen development during cooling. 
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(d) The apparent difference in stability of hydroxyl groups at 1000 °C should 
perhaps be ascribed to differences in oxidation state of the glass network. As 
long as oxidizable centres are available, hydrogen should evolve. When, how- 
ever, oxidizable centres are no longer available, water is likely to be formed. 
The removal rate of hydrogen and water from the vitreous silica is a function 
of the diffusion coefficient, the physical solubility and the geometry of the 
sample. 

The conversion of hydroxyl groups to oxygen bonds and H, (or H20) 
is hampered by the rate of removal of one of the reaction products. 

The rate of removal of physically dissolved water is relatively much slower 
than that of physically dissolved hydrogen. 


5. Final remarks 


(a) Recent experiments have shown that it is possible to determine the tem- 
perature dependence of the equilibrium reaction 


SiO, + H; = 2 =SiOH + SiO 
liq. gas diss. diss. 


in the temperature range 1200 to 1550 °C. 


(b) The first results of extraction experiments have shown that the hydrogen 
content of vitreous-silica samples does not agree with the value calculated 
from the 2-73-um absorption band. Use was made of the molar extinction 
coefficient (e, = 77.5 1 mole^! cm^!), which was determined by, among 
others, Stephenson and Jack ?). 


(c) According to the authors, complex formation between hydroxyl groups 
and negatively charged centres in the glass lattice or the formation of Si-H 
bonds, may explain the discrepancy between the results of the extraction ex- 
periments and the results calculated from IR-absorption measurements. 


The authors intend shortly to publish the results of these experiments. 
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THEORETICAL AND EXPERIMENTAL 
INVESTIGATIONS OF DIGITAL MAGNETIC 
RECORDING ON THIN MEDIA 


by D. L. A. TJADEN and E. J. TERCIC 


Abstract 


A large number of results obtained by means of a computer-simulation 
model of the digital magnetic recording and replay processes are 
reported. It is assumed that the recording medium is relatively thin and 
has simply describable magnetic properties. Results are compared with 
those of simple analytical theories recently proposed in the literature. 
Good agreement is found when computed results are compared with 
those obtained experimentally with the help of a flux-sensitive head. In 
the appendix some analytical expressions are given for demagnetization 
effects in a square-loop medium. 


1. Introduction 


Although the general problem of how to give an adequate mathematical 
description of the magnetic recording process does not seem to have been 
solved yet, the situation is considerably more satisfactory for the case of un- 
biased digital recording on media that have a large ratio of remanent induction 
4xM, to coercivity H.. If 4xM,/H,>> 1, as is the case for most continuous 
metal films (e.g. Co-P) proposed for recording purposes, demagnetization 
effects prevent the existence of an appreciable magnetization component per- 
pendicular to the recording layer and, in the analysis, restriction to longitudinal 
field and magnetization components seems a good approximation. Furthermore, 
the layer thickness is usually small compared to all other relevant dimensions 
which suggests the assumption of uniformity of field and magnetization over 
the thickness. In fact, the actual situation with layer thickness ? and magnetiza- 
tion M can be approximated by the limiting case where M — co and t — 0, 
the product Mt being kept constant. 

Within the limitations of the well-known Karlqvist approximation assuming 
the magnetic scalar potential to vary linearly across the gap, the field strength 
acting on the layer (see fig. 1) is then given by *) 


0-4 
A(x) = — — arctan 
g a a 


| ? dM (x^) 1 x—x 
+ 2t (- + — a) dx’, — (I) 


dx' x—x (x— x} + 4a? 


I x — x, + g/2 x—x,— g/2 
(usus 07 E: 
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Coordinates 
fixed to medium 


y | Recording medium 


E81 M(x), H(x) 


La 


Recording-head pole pieces 
Fig. 1. Configuration and coordinates. 


in which 

I = ampere turns across the gap, 

g = recording-gap length, 

x = distance along medium, 

x, = relative position of recording-gap centre, and 

a = average head-to-medium spacing. 

The coordinate system is fixed to the medium and x, is assumed to increase 
with time. 

The Karlqvist approximation holds well when g/a < 4 which we assume 
throughout this paper. For larger values of g or for more-complicated head 
geometries both terms in (1) would become considerably more intricate. 

Introduction of a suitable hysteresis model, defining M(x) in terms of the 
current value of H(x) together with (in principle) all past extremal values of 
H(x) with respect to time, completes the recording model. At any instance of 
the process, in which 
(i) J varies as a function of increasing Xm 
(ii) the recording head is removed from the medium, and 
(iii) the replay head is brought near the medium, 
eq. (1) should be satisfied for an M (x) and H(x) which remain in accordance 
with the hysteresis model. Discretization of the model, by considering M and 
H in an array of discrete points, replacement of the integral in (1) by a sum 
and introduction of finite sequential steps describing the successive changes of 
I and of the configuration, reduces the problem to that of repeatedly solving 
a system of, say, 50-200 non-linear equations. 

A number of results computed along these lines, indicated originally by 
Iwasaki and Suzuki?) and usually denoted as “self-consistent” recording 
models, have been published in recent years. Curland and Speliotis ?:^) and 
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Curland 5) gave results for single transitions on a previously saturated medium, 
of which the magnetization is locally reversed by application of the head field. 
This is one of the simpler cases to treat because, if the recording field saturates 
the medium, there is no need to take the head motion into account explicitly. 
The hysteresis, model they use is built up with straight lines and has a com- 
plicated minor-loop "strategy". Potter and Schmulian !) also considered cases 
where the first transition is followed by at least one more, thus making it 
necessary to consider what happens during head motion and increasing con- 
siderably the number of successive iteration series required. Furthermore they 
give results for finite-poletip recording and replay heads. Their hysteresis model 
is described in terms of hyperbolic-tangent functions. Tjaden 5) gave results 
referring to the validity of the “superposition principle" for two closely spaced 
transitions recorded on a previously a.c.-erased medium. His hysteresis model 
is extremely simple (fig. 2) involving four parameters: remanence Ma, coercivity 


Slope Xirr 


Slope Xrev 


T 


Fig. 2. Hysteresis model used in ref. 6 and in this paper. 


H,, reversible susceptibility y,.,, and loop susceptibility Xir; (the results presented 
were for 4,4, = 0, yi = oo). 

, Very recently a few other papers appeared on the subject. Chi and Speliotis 7) 
presented results (based on an extension of Curland's model) for a more-com- 
plicated pulse pattern, which they compared with experiment and with the out- 
come of superposition theory. Nishimoto et al.) improved the hyperbolic- 
tangent model of ref. 1 such as to avoid instabilities which appeared to occur 
when many minor loops are to be described. They computed results for density 
characteristics and for a certain worst-case pattern (... 11111011111... in NRZ-1). 
Finally, Dejouhanet and Lazzari?) published results for three-bit patterns 
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recorded by integrated heads. They chose a rather simplified approach in order 
to avoid the large number of iteration series required. In both refs 8 and 9 the 
image term (representing the effect of the presence of the head, corresponding . 
to the second term in the integral of eq. (1)) in the expression for the demagnet- 
izing field was neglected. 

References 1 and 5, in particular, have contributed considerably to the knowl- 
edge of what really happens during the magnetic recording process on thin 
metal films. On the other hand, due to the large number of parameters involved 
and the relatively long computing times which seemed necessary for this type 
of computation, the results have a specific rather than a general character and 
it seems difficult to *extrapolate" them to other situations than just those con- 
sidered. This is particularly true for cases involving more than one transition 
and holds even for the simplest case of an isolated transition. This may explain 
why, even recently, simple approximate theories have been proposed which treat 
this single-transition case by introducing severe constraints on theshape of M(x), 
used for calculating the demagnetizing field. If M (x) varies according to 


2 x 
M (x) = — Mg arctan — : (2) 
b c 


(the “arctangent-transition model” which has the advantage of much simpli- 
fying the theory of the replay process) then the demagnetizing field term in (1) 
' is given by 


x x 
H = 4M, t| —————- ——————— |, 3 
262 : ( 3:2 aoa) ©) 


and the problem is reduced to that of finding an appropriate value of the 
parameter c. 

The first theory of this kind was originated by Chapman t°). The most 
severe demagnetization conditions apply in free space (thus for a — oo in (3)) 
from which, according to Middleton ++), it follows for a square-loop material, 
by the argument that nowhere |Æ] > H,, that 


2 Mgt 
c x C, = 


(4) 
In more-recent papers by Williams and Comstock 17) and by Maller and 
Middleton +°) this result was extended to more-general loop shapes. In both 
these papers, demagnetization effects during recording were also treated by 
requiring that, immediately after the current reversal (in our notation), 


dM dH 


n © - 
EE UT dx (5) 
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at the centre of the transition. With the help of (1), (2) and (3) the parameter c 
of the arctangent transition could be determined. In both refs 12 and 13 the 
second (image) term of (3) was neglected. Depending on the reversible suscep- 
tibility x,.,, further changes occur in the central slope of the transition leading 
to final results for c (c = cz, say) which in ref. 12 are related to the free-space 
condition and in ref. 13 to the situation where the medium is completely shunted 
by the contact with the replay head. The larger of the two results c — c, and 
c = Cz is finally assumed to be the true one. 

Our computer program for self-consistent computations, as described in 
ref. 6, requires only very modest run times per case considered. It seemed 
therefore useful to apply it to a large number of cases and thus try to provide 
a general picture of the role of the various parameters. The results for isolated 
transitions are, furthermore, compared with those predicted by the "slope 
theory" of refs 12 and 13 which we slightly modified so as to incorporate the 
image effect of the recording head as well as that of the replay head (the second 
term of (3) is thus taken into account). In this way the basic assumptions under- 
lying the two approaches are the same and the only difference concerns the 
*arctangent-transition" constraint of the slope theory. 

When calculating the replay signal from the recorded magnetization we will 
assume that the head-to-medium spacing is the same for recording as it is for 
replay. Again using the Karlqvist approximation we have from the reciprocity 
theorem 


+ 


4tw pa x' J- g/2 
@(x) = — - (arctan 
a $8 


x' — g[2 
—arctan LM) M(x--xdx (6) 
a a 


-00 


for the replay flux in a unit-efficiency, one-winding, semi-infinite replay head 
(w — track width, x — location of gap centre, and here g represents the replay- 
gap length) If the head moves with velocity v, then the induced voltage is 
given by 
. e = —1073 v w Hx) (7) 


where 
A(x) = id — D(x 8 


with (x) given by (6). The quantity H,(x) (in oersted), which for the case 
g — 0 represents the induced field strength at the head surface, was chosen as - 
a natural measure for replay signals discussed in this paper. 

Of course, the differentiation of (8) can be performed under the integral sign 
of (6). Furthermore, all linear frequency-dependent effects in the replay head 
or in the replay channel following it which one might wish to consider can 
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most easily be taken into account by replacing the Karlqvist factor in (6) by 
the appropriate function of x'. For such purposes therefore no Fourier trans- 
forming of the computed replay signal is needed. A further discussion of these 
matters is, however, outside the scope of this paper. 

The nature of the equations governing the problem makes it possible to work 
with reduced quantities. Field strengths will be reduced by the coercivity H, 
and lengths by the head-to-medium spacing a. For the layer we introduce a 
“characteristic length” 


4a Mat 
H, 


p (9) 
which for saturation recording is the minimum transition length for a square- 
loop material (see the appendix). Instead of the absolute value of the recording- 
Bap ampere turns 7 we shall use 


04 |Z| arctan (g/2a) 
$m 
a H, g/2a 


which represents the reduced maximum head field exerted on the layer. 
Instead of the susceptibilities %,ev and x,,, we introduce r = H,y,,,/Mg and 
q = Mgr/ He Xir; thus a square-loop material has r = q = 0. 

Where single transitions are considered we shall assume that the recording- 
gap ampere turns switch from Ih | to —|z | at x, = 0. We shall characterize 
the replay pulse by the quantities PWs0/a, xo/a, Fso, and H,/H, where (see 


(10) 


fig. 3): 
PW; = pulse width at half height = x, — x,, 
Xo = peak location, S 
Fso = asymmetry factor (as defined in ref. 1) = (x; + x4 — 2X9)/(x2 — x1) 
and 
H, = pulse height. 


Fig. 3. Characterization of replay pulse. 
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We consider the case where g — 0 and where a single transition is recorded 
on a previously a.c.-erased square-loop medium as “basic” and this case we 
shall treat in detail in sec. 2. Next we discuss (as perturbations") the effects of: 
— pre-saturation of the medium (in sec. 2), 

— non-zero gap length (in sec. 2), 

— interaction between neighbouring transitions and violations of the super- 
. position principle (in sec. 3), and 

— the influence of the loop-squareness parameters r and q (in sec. 4). 

In sec. 5 we shall discuss a number of experimental results obtained with the 
help of a flux-sensitive head 14:15) on a thin Co-P layer and on a thin oxide 
layer. These results will be compared with those predicted from the self-con- 
sistent computations. 

In the appendix we give an exact (unconstrained) solution to the problem of 
free-space demagnetization of an originally linear transition recorded on a 
square-loop material, and discuss the consequences of this solution for the 
replay pulse shape. A similar result is also given for the density characteristic, 
thus forming an extension to a result published in an earlier paper *°). 


2. Single transition on square-loop material 


2.1. General 


As stated above, we first consider the case of a single transition where the 
loop is rectangular (r = g = 0). If furthermore g = 0 and the initial condi- 
tion of the medium is specified, the recorded magnetization and the replay 
signal can be expressed as functions of x/a, involving only two parameters: 
the reduced head field s and the ratio pja (= 4z.Mgt/ H, a) which is a measure 
for the dominance of demagnetization effects in saturation recording. If g = 0, 
the head field reduces to H, = 0:4 Ia[(x? + a?). If s< 1 no recording at all 
takes place. A substantially higher value of s may be necessary, however, in 
order to result in saturation recording, that is, in a transition in which the 
magnetization changes from -- Mg to —Mg. First we shall analyse the con- 
dition for this to happen. 


2.2. Non-saturation recording 


We shall, therefore, treat the non-saturation case first. If the medium was 
a.c.-erased prior to recording, we have the situation of fig. 4 which shows the 
magnetization distribution along the medium at various stages of the process, 
for pja = 10 and s = 2. Immediately prior to switching, the magnetization 
varies according to curve (1), between a value My < Mg and zero. Imme- 
diately after switching we have curve (2), while curve (3) shows the situation 
when the recording gap has moved far to the right. A number of intermediate 
stages are shown also, for x,/a — 2, 4, 6, 8, and.10. The recorded magnetiza- 
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——— X/a 
Fig. 4. Various stages of the recording of a single transition with a write field too small to 
saturate the medium. Drawn curves: (1) prior to switching, (2) immediately after switching, 


(3) head moved far to the right. Dash-dotted curves show intermediate stages. Dotted curve: 
effect of free-space demagnetization. Dashed line: asymptotic value — Mo/Mg. 


tion thus varies between -- M, and the asymptotic value —Mp. The effect of 
free-space demagnetization (removal of the head) is very small in this particiar 
case, as indicated by the dotted curve. 

As long as Mo < Mp, the result does not depend on Mpg any more; thus, 
on dimensional grounds, it is easy to see that the resulting magnetization M (x) 
is expressible in the form 


4n Mt 
H.a 


= f Gia, s) (11) 


when p > po where po = 4%Mot/H,. The relation between po/a and s is de- 
picted in fig. 5. This curve thus gives the minimum value of s (so(p/a), say) 
necessary to reach saturation for a given value of p/a. 

If the medium was pre-saturated prior to recording, the situation becomes 
different. If M = Mg originally (positive pre-saturation), the transition goes 
from Mg to Mr — Mo, Mo being the same as in the previous analysis. This 
can be understood from the fact that the difference which can be brought 
about by the head field between stationary magnetization levels at the two 
sides of the gap region is determined by demagnetization conditions. Similarly, 
if M = —Mmg originally (negative pre-saturation) the transition goes from 
—M,+ M, to —Mg. The condition for saturation recording on a pre- 
saturated medium is obviously s > so(2p/a). 

Pulse shapes are rather different for these cases. Table I gives a number of 
computed values of PWso/a, Fso, Xo/a, and É,/H,, together with the value 
which p/a is assumed to exceed for this analysis to be valid, for different values 
of s and for different initial conditions. In all cases, except when M = —Mg 
initially, the pulses are extremely asymmetric. 
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02; 15 2 25 


—— S 


Fig. 5. Reduced maximum obtainable magnetization po/a (= 4zMpgt/H,a) versus reduced 
write field s. 


TABLE I 
- Single-pulse characteristics for transitions recorded below saturation 
pla 


should 
exceed: 


initial 
state 


virgin 
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1 


2.3. Saturation recording 


If we have s > so(p/a) (or s > so(2p/a) for a pre-saturated layer), the 
magnetization in the recorded transition varies between +M, and —Mg. We 
computed the transitions for a range of values of s and p/a. The latter parameter 
was varied between 0:5 and 20 and it appears that this range covers not only 
the cases of practical interest but is also the most interesting from a theoretical 
point of view. At p/a < 0-5 the replay pulse shape is dominated by the effect . 
of replay-head spacing and at p/a > 20 the recording process itself may be 
considered ideal *) and the results can be adequately described by the theory 
given in the appendix. 

In this case a transition of finite length is formed, ranging between values of 
x/a shown in fig. 6 as a function of s, for pja = 0-5, 1, 2, 5, and 10. The medium 


Beginning End of transition 
O_o OOO 
10 5 210505125 10 -«— p/a 


~10 -5 0 5 10 


—— x/a 
Fig. 6. Boundaries of transition region as a function of reduced write field 5, for various 
values of p/a. 


was assumed to be a.c.-erased originally. With increasing s the transition length 
shows a minimum (aL, say) which is reached at a value of s which increases 
with p/a. If we denote the corresponding minimum transition length before 
removal of the recording head by aL’ then it appears that L = L’ for p/a < 2, 


*) This becomes different when r > 0; see sec. 4. 
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Fig. 7. Reduced minimum transition lengths L’ and L vs pla before and after free-space 
demagnetization respectively. 


whereas L æ pja for pja > 5. This is shown in fig. 7 which depicts L and L’ 
versus p/a. It appears that L' ~ 1-7 (p/a)®°® in this range. 

Figures 8 and 9 show respectively the reduced pulse width and pulse height 
as functions of s, with p/a as the parameter. In these pictures the non-saturation 
case of the previous section is included. Figure 10 shows, for p/a — 1 and for 
pla — 10, the effect of pre-saturation of the medium on the reduced pulse 
width. 


Fig. 8. Reduced pulse width vs reduced write field for a.c.-erased medium. The parameter 
is p/a. 
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1 2 5 10 
— > S 


Fig. 9. Reduced pulse height vs reduced write field for a.c.-erased medium. The parameter 
is p/a. 


d 2 5 10 


Fig. 10. Reduced pulse witdh vs reduced write field for p/a = 1 and for p/a = 10. Medium 
positively pre-saturated (1), a.c.-erased (2), and negatively pre-saturated (3). 
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In fig. 11 the minimum value of PWi;o/a with respect to s is shown as a 
function of p/a, compared with the corresponding results according to: 
(1) arctangent transition consistent with free-space demagnetization (from (4) 
one has PW;o/a = 2 + p/za), 
(2) the “slope theory” discussed in sec. 1 (here the minimum pulse width al- 
ways occurs at s = 2 when the head field gradient is at its maximum), and 
(3) free-space demagnetization of ideal (step-like) transition (see the appendix). 


02 05 7 2 § 10 20 50 100 


Fig. 11. Minimum reduced pulse width PW 56/a vs pja (drawn curve). Dashed curves show 
results of other theories (see text): (1) arctangent transition, (2) slope theory. The dash- 
dotted curve (3) shows the result of free-space demagnetization of a step-like transition. 


Figure 12 shows the corresponding values for the reduced pulse height for these 
cases. A number of typical results are also given in table II, including values 
found for the pulse asymmetry F5, and the pulse location xo/a. 


02 05 1 2 5 10 20 50 100 
——»- p/a 


Fig. 12. Reduced pulse height ÉL H, vs pla for cases as in fig. 11. 
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TABLE II 


Single-pulse characteristics for various cases (square-loop medium) and cor- 
responding results from "slope theory" 


—0-005 


“slope theory” 


virgin 
0-221 


” 


1 » 0-411 
1 Mg 0-411 
1 —Mng 0-411 
1 virgin 0-409 
1 +Mp 0-409 
1 —M, 0-409 
1 virgin 0-395 
2 | 2 0-721 
2 5 0-688 
5 2 1:47 
5 3 1-39 
10 | 3 2:41 
10 |3 +Mep 2:41 
10 3 —Mn 2:41 
10 5 virgin 2:31 
10 5 T Mg 2:31 
10 5 —Mng 2:31 
10 8 virgin 2:19 
20 |3 8:37 *)| 3-04 *) 
20 5 8-37 *)| 3-04 *) 
20 8 8-37 *)| 3-04 *) 


*) Values following from eq. (4). 


When we compare the results of the various theories it appears that in certain 
regions for p/a remarkably similar values are predicted for either pulse width 
or pulse height or both. In other regions large discrepancies occur, especially 
for the pulse height. The "slope theory" appears to deliver results which in the 
region pja ~ 1 are much too pessimistic. The reason for this rather unex- 
pected failure appeared to be the assumption of an arctangent transition shape 
directly after switching. The value of dHp/dx needed in (5) is very sensitive 
to what at first sight do not seem very important modifications of the assumed 
transition shape. This becomes clear if we compare the free-space values of 
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(d.Hp/dx),..9 for a linear transition, an arctangent transition, and a quadratic 
transition +°), all having the same slope y~* at the origin (see table IIT). 


TABLE III 


Slope at origin of the free-space demagnetizing field for a few transition models 


1 (5) 
Mgt dx x=0 


transition model M (x)/Mg 


linear: 
x, x| < 
ly (X «» -Aii 
sign(x) — (|x| > y) 
arctangent: 
2 mux 
— arctan — —n?[y? 
m 2y 
quadratic: 
1 -Bj (|x| < 29) 
4y 
—o0 


(xp > 2y) 


For small values of p/a the replay spacing effect is dominant and we find 
approximately PW.,/a + 2 and EH, (2/2) pla, as should be the case 
without demagnetization effects. For large values of p/a (> 10) the free-space 
demagnetization effect is dominant, making it relatively unimportant again 
what happened in detail during recording. Here we have approximately 
PW;o/a e 2 (p[a)? and EH, ~ (4/2) In (p/a) (see the appendix; the latter 
result follows from eq. (29)). 

As pointed out earlier by Curland 5), both the dominance of replay-head 
separation (small p/a) and the dominance of free-space demagnetization (large 
p/a) tend to symmetrize the pulse shape. Furthermore, as also found by Potter 
and Schmulian +), the pulse asymmetry changes from positive to negative values 
with increasing recording current. 


2.4. Non-zero gap length 


As stated before, our use of the Karlqvist approximation limits the validity 
of our approach to the range of relatively small values of g, g/a « 4, say. 
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Within that range we performed a number of computations to investigate the 
influence of gap length. We assumed write gap (g,) and read gap (g,) to be 
equal (g), but — in order to get an impression of the relative importance of 
the two — we repeated, in every case, the replay part of the computation 
assuming g, = 0. We restricted this analysis to the cases p/a = 1 and p/a = 10 
for an a.c.-erased layer. 

One. of the effects of increasing g,,/a, for a given value of s, is a broadening 
of the maximum of the head field vs x/a and, therefore, saturation of the 
medium becomes easier. This is demonstrated by the curves of fig. 13 which 


Fig. 13. Influence of gap length. Reduced pulse width vs reduced write field for p/a — 1 
and p/a = 10, and for (a): gy = g, = 3a, (b): gy = 3a, g, = 0, and (c): g, = g, = 0. 


show, for p/a = 1 and for p/a = 10, the dependence of PW59/a on s for the 
three cases (a): gy = g, = 3a, (b): g, = 3a, g, = 0, and (c): g, = g, = 0. 
Comparison of these curves indicates that the resolution obtainable in the 
recording process is practically unaffected by the increase of g,, and that the 
replay effect is pre-dominant. It appears that the effect of the replay gap on 
pulse width can be well described (but is slightly overestimated) by the formula 


PWso,=9) = [(PWsoq, 20)? + g?]? (12) 


which should hold exactly for arctangent transitions ++). This can be seen from 
the data in table IV which, for a few cases, gives additional information on 
pulse height, pulse location, and pulse asymmetry. Note that the pulse location 
is measured relative to the recording gap centre at the instant of switching. 
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TABLE IV 


Effect of variation of write and/or replay gap lengths 


3. Interaction of transitions 


3.1. General 


So far we have considered isolated transitions only. If the "superposition 
principle" were strictly true, detailed knowledge about single-transition pulse 
shapes would be sufficient to predict the replay signal for general bit patterns. 
Unfortunately, the rather stringent conditions under which superposition can 
be assumed to hold well$) are met only in those practical situations where 
either: ; 

(a) the magnetization transitions are far enough apart not to interact appreciably 
(either during recording or after) and any possible overlapping of the replay 
pulses is due to the linear replay process, or 

(b) irreversible demagnetization effects are negligible and the partially over- 
lapping transitions are not too close, this condition depending on the shape 
of the Preisach distribution function of the medium *). 

The dominance of demagnetization effects, on the other hand, makes applica- 
tion of the superposition principle at least questionable. Transitions already 


*) It must be remarked that the validity of the Preisach model for continuous metal recording 
layers seems never to have been investigated. 
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written affect, by their contribution to the demagnetizing field, the way in which 
a transition is recorded even if they are sufficiently far away to prevent them 
from being partially overwritten. Furthermore, the effect -of free-space de- 
magnetization depends on the recorded transition pattern in a non-linear way 
which can be easily demonstrated to be inconsistent with the superposition 
principle. Tjaden 5) gave a few examples of magnetization distributions for a 
two-pulse pattern; in the following we shall extend these results to include some 
more-complicated situations with an emphasis on the shape of the replay signal. 


3.2. Two transitions 


We computed the replay signal for an isolated pair of transitions at various 
spacings for the cases p/a = 1, s = 2-5 and p/a = 10, s = 5. The replay signal 
is best characterized by the heights (H, and H;) and positions (x, and x;) of 
its two peaks (see fig. 14) where the latter must be compared to the peak posi- 


Fig. 14. Characterization of two-pulse signal. 


tions (x,' and x,’) that single transitions written at the same head positions 
would have shown. We present as the peak shift the differences x,' — x, and 
X3 — X5', both of them reduced by the head-to-medium spacing a but not, as 
usual, expressed as a percentage of the nominal bit separation b. Results are 
shown in figs 15 and 16. The dashed curves show the similar results obtained 
by superposition of single pulses. It appeared that H, and H, were not much 
different and the curves represent their average value. The peak shifts, however, 
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Superposition 
~ 


~ 


Fig. 15. Reduced peak shifts and reduced signal amplitude for two pulses at reduced nominal 
separation b/a, for the case p/a = 1, s = 2:5. Dashed curves show similar results obtained 
by superposition. 


show a large asymmetry which is contrary to what follows from superposition. 
In the first case (p/a — 1, fig. 15) superposition appears to predict fairly well 
the peak height as well as the average peak shift. In the second case (p/a — 10, 
fig. 16) the agreement is poor, particularly for the peak shift. 


-——— 


L^" Superposition 


E y 
T ~~ Superposi tion 
M x 


0 
10 12:5 


Fig. 16. Reduced peak shifts and reduced signal amplitude for two pulses at reduced nominal 
separation b/a, for the case p/a = 10, s = 5. Dashed curves show similar results obtained by 
superposition. 


3.3. Three transitions 


A possible breakdown of the superposition principle is demonstrated very 
clearly when we consider an isolated group of three transitions at equal nominal 
spacing b. Resulting magnetization distributions for such a case (p/a = 10, 


INVESTIGATIONS OF DIGITAL MAGNETIC RECORDING ON THIN MEDIA 139 


"ZEN 


-02 
-04 
-0 
-08 
Lo -5 0 10 15 20 


——— — xa 


Fig. 17. Resulting magnetization distributions after recording of three equally spaced pulses 
(p/a = 10, s = 5, various values of b/a). 
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Fig. 18. Replay signal for three equally spaced pulses with p/a = 1, s = 2:5, and various 
values of b/a. Dashed curves show results of superposition. 
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— x/a 


Fig. 19. Replay signal for three equally spaced pulses with p/a = 10, s = 5, and various 
values of b/a (similar to fig. 17). Dashed curves show results of superposition. 


s = 5, various values of b/a) are shown in fig. 17. The second transition is not 
centred between the two outer ones but is shifted towards the first one, resulting 
in a typically asymmetrical replay signal. Superposition would predict a sym- 
metrical signal if the isolated pulse is symmetrical. For various values of b/a 
these replay signals, together with those obtained by superposition, are shown 
in fig. 18 (p/a = 1, s = 2:5) and fig. 19 (p/a = 10, s = 5; similar to the case 
of fig. 17). Note that variation of b/a affects the height of the third pulse only 
slightly. 


3.4. Density characteristic 


The replay signal for a periodical sequence of transitions at equal spacings is, 
in the literature, the subject of many experimental and theoretical investiga- 
tions. The relation between peak amplitude and bit spacing is generally referred 
to as the “density characteristic”. Its decay at close bit spacing is often con- 
sidered a measure of a system’s resolution. A theoretical maximum, for a 
square-loop material, is given in the appendix. Superposition of HH, = 
+ (1 + x?/C?)*! pulses at spacing b leads with some simple calculations to 


the expression 


» x zc zc 
Hb) = Hoo) EE cosech 2 (13) 


INVESTIGATIONS OF DIGITAL MAGNETIC RECORDING ON THIN MEDIA 141 


for the density characteristic, in which A,(co) represents the single-pulse height 
H,. By considering signals consisting of a relatively small (6-12) number of 
transitions it proved possible to compute the replay signal for various points 
of the density characteristic with sufficient accuracy. With q = r = 0, g = 0, 
this was done for the two cases p = 1, s = 2-5 and p = 10, s = 5. Figures 20 


1 


oal Superposition of 
Fis(b/8) cornputed pulses 
Hg (co) 


E 


0-4 Superposition 
d acc. to eg. (13) 


0-2 


% 05 1 15 2 25 
——v»- b/a 
Fig. 20. Computed density characteristic for p/a = 1, s = 2-5 (drawn curve). Dashed curves: 
results of superposition. Dash-dotted curve: maximum for ideal recording process. 
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Fig. 21. Computed density characteristic for p/a — 10, s — 5 (drawn curve). Dashed curves: 
results of superposition. Dash-dotted curve: maximum for ideal recording process. 
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and 21 show the computed normalized amplitude A,(b/a)/H,(co) vs the reduced 
bit spacing b/a for these cases, together with the result of superposition of com- 
puted pulses, the result according to (13) (taking C = 4 PW;o), and the 
theoretical maximum according to eq. (30) of the appendix. 
Furthermore, it appeared that, at higher densities and particularly for the 
case’ p/a = 10, a considerable phase shift occurs between the computed and 
superposed signals, as shown in fig. 22. The peaks of the computed signal, 


2 


p/a 


— b/a 


Fig. 22. Reduced distance o/a by which peaks of computed density-characteristic signal are 
advanced with respect to those obtained by superposition, for p/a — 1, s — 2:5, and for 
pila = 10, 5 = 5. 


expressed in terms of x/a, appear shifted to the left (thus advanced in time) 
over a distance o/a which increases with increasing b/a. This shift is a con- 
sequence of the effect of the demagnetizing field from the pattern already 
recorded. When recording a single transition, the magnetized region on its left 
produces a contribution to the demagnetizing field which counteracts the head 
field. As a consequence the transition is recorded closer to the gap. 

It is not unlikely that this effect can be partly compensated by introducing 
appropriate corrections, derived from the previously recorded signal, to the 
recording current and/or its switching moment. 


4, Consequences of loop squareness 


Introduction into the analysis of the two loop-squareness parameters g and r 
makes it necessary to impose some limitations on the values considered for the 
other parameters. Throughout this section we shall assume g = 0 and consider 
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only the cases p/a — 10 and p/a — 1. Also we shall restrict ourselves mainly 
to the single-pulse case; a few examples for two interacting transitions will, 
however, be given. 

At first thought one would expect non-zero values of q and r to give rise to 
broader and lower pulses than found for a square-loop material. Such a 
behaviour is also predicted by the slope theory, as we shall see. The effect of 
a non-zero reversible susceptibility, however, may be that free-space demagnet- 
ization is partially or completely reversible and thus much less harmful (such 
an effect was also found in the analysis of free-space demagnetization in the 
model of Maller and Middleton +°)). In these cases, particularly, omission of 
the image term in (1) leads to fundamentally wrong results. 

The first cases we considered concern the dependence of pulse width and 
pulse height on recording current. This we did for g = 0, r = 0:4, forq = 0-4, 
r = 0, and for q =r = 0-4. These results are shown in fig. 23 (PW;o/a vs s) 
and in fig. 24 (/7,/ H, vs s), together with the corresponding curves for q = r = 0 
from figs 8 and 9. As it appears that the value of s for which the pulse width 
is a minimum does not change much with g and r we decided to restrict our 
attention further to the cases s = 5 for p/a = 10 and s = 3 for pla— 1. ` 

Results for p/a = 10, s = 5 are shown in figs 25 and 26, for q and r ranging 
between 0 and 0-6. The case q = r = 0-6 was excluded because in this case 
no saturation was reached with s = 5. Apart from the computed values of 
PWio/a and H,/H, the figures show the corresponding results from the slope 
theory which, as it appears, much overestimates the effect of increasing r. Also 
shown are the results which would follow for an "ideal" recording process, 
obtained by applying the free-space demagnetization and replay stages of the 
self-consistent computation to an originally step-like transition (this is the 


Fig. 23. Reduced pulse width vs reduced write field for p/a — 1 and p/a = 10, and for 
various values of the loop-squareness parameters g and r. (a): (q, r) = (0, 0); (b: (q,r) = 
(0-4, 0); (c): (q, r) = (0, 0-4); (d): (g, r) = (0-4, 0-4). 
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Fig. 24. Reduced pulse height vs reduced write field for cases as in fig. 23. 


0 OF 02 03 04 05 06 07 
Pf 
Fig. 25. Reduced pulse width PW 5ọ/a vs reduced reversible susceptibility r for p/a = 10, 
s = 5, and for various values of the loop-squareness parameter g (drawn curves). For com- 
parison: result from slope theory (dash-dotted curves) and result for ideal recording process 
(dashed curves). 
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Fig. 26. Reduced pulse height A,/H, vs r for cases as in fig. 25. 
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Fig. 27. Results as in fig. 25 but for pja = 1, s = 3. 
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Fig. 28. Results as in fig. 26 but for pja = 1, s = 3. 


TABLE V 


Effect of loop-squareness parameters on single-pulse characteristics 
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situation treated analytically in the appendix for a square-loop medium *)). 
For these latter curves, the value of s is irrelevant, of course. They clearly 
show the effect mentioned above of pulse sharpening with increasing r due to 
free-space demagnetization effects becoming more and more reversible. It ap- 
pears however that, contrary to the situation when g = r = 0, the assumption 
of an ideal recording process leads to wrong results when q and, especially, r 
take on larger values. The corresponding results for p/a — 1, s — 3 are shown 
in figs 27 and 28. In table V some figures are given, including values of Fs 
and xo/a. It appears that larger values of r lead to less symmetrical pulses when 
pla = 10. This is explained by the fact that irreversible free-space demagnetiza- 
tion tends to symmetrize the pulse, as we saw earlier. 

Finally we considered the effect of varying q and r on the characteristics of 
a two-bit signal. Some of these results, for the quantities defined in fig. 14, are 
presented in table VI (all for p/a — 10, s — 5). Figures between parentheses 
give the corresponding results obtained by superposition. It appears that with 
increasing q the peak heights obtained by superposition come into better 
agreement. The peak shifts, especially that of the first peak, are always pre- 
dicted very badly by the superposition principle. ^ 


TABLE VI 


Characteristics of two-bit signal for p/a = 10, s = 5, and various values of q 
and r. Figures between parentheses obtained by superposition 


O | 125 (1-65) | 1:34 (1-65) | 1-79 (0-66) | 0:47 (0-62) 
06 | 0-85 (1-35) | 0-82 (1-44) | 1-56 (1-01) | 0:05 (0-75) 
O | 1:37 (1-43) | 1-46 (1-44) | 1-83 (0:53) | 0-66 (0-50) 
O | 138 (1-28) | 1-51 (1-29) | 1-85 (0-44) | 0-79 (0-42) 
O | 1-33 (117) | 1-49 (1-18) | 1-87 (0-40) | 0-81 (0-35) 
0 | 1-66 (2:62) | 1:82 (2:60) | 1:33 (016) | 0:73 (0-16) 
0 | 1-74 (232) | 1-83 (232) | 137 (0-26) | 0-76 (0:26) 
O | 168 (192) | 1-79 (1-93) | 1:37 (0:24) | 0-77 (0-24) 
O | 1-59 (1-68) | 1-72 (1-68) | 1:34 (0-16) | 0-75 (0-15) 


*) As recently shown by Van Herk and Wesseling +7) the limiting case q = 1, r = 0 is 
also analytically tractable. 
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5. Comparison with experiments 


The adequacy of the simple hysteresis model used throughout this paper was 
tested by applying the computer program to some experimental results obtained 
by means of a flux-sensitive head !^) on a thin Co-P layer and on a thin oxide 
layer. Some of the experimental results and details about the way in which they 
were obtained have been published in an earlier paper by Tercic +°). The flux- 
sensitive head has à 2 um gap, 50 turns, and a recording efficiency of 90%, 
thus 1 mA through its winding results in 0-045 ampere turns across the gap. 

For the computation of demagnetizing fields we did not use eq. (1) of this 
paper but eq. (4) of ref. 6, in which the effect of non-zero layer thickness is 
accounted for in a first approximation. 

The Co-P layer has 4%Mp = 10000 G, H, = 241 Oe, and t = 0-15 um. Its 
hysteresis loop, with some minor loops, is depicted in fig. 29. This loop was 


Fig. 29. Hysteresis loop with minor loops of Co-P layer used for experiments. Dashed lines ' 
show loop assumed for computations. 


approximated by the dashed one indicated in the figure, using q = 0-1 and 
r = 0:2. The Co-P layer has a chromium top layer of 1-2 wm. Assuming a 
residual head-to-medium spacing of 0-3 um we used a value of a as defined 
in this paper: a = 0-3 + 1-2 + 0-15/2 = 1-575 um. In all experiments the 
material was a.c.-erased prior to recording. l 

Computed and measured single-pulse shapes and positions are compared in 
fig. 30, for various recording currents. It may be concluded that, even for cur- 
rents appreciably below the value required to reach saturation, the agreement 
is still reasonable. 

The maximum pulse height occurs at a recording current of about 8 mA and ^ 
a number of experiments and computations were done for this current involving 
the interaction of two or more pulses. 
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Fig. 30. Measured (dashed curves) and computed (drawn curves) single-pulse shapes for 
Co-P layer. 
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Fig. 31. Pulse height HY, vs nominal bit separation b for two pulses recorded on Co-P layer. 
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Fig. 32. Peak shifts x,’ — x, and x2 — x2’ vs nominal bit separation b for two pulses recorded 
on Co-P layer. 


Figures 31 and 32 show results obtained for two pulses at various distances. 
Figure 31 depicts the average height of the two pulses. For comparison, the 
results of superposition both for the measured and computed pulses are also 
shown. The experimental points do not lie exactly on the computed curve but 
the observed deviation from the result of superposition is well predicted by the 
model. The peak shift, depicted in fig. 32 in a manner similar to figs 15 and 16, 
shows a large asymmetry, confirmed by the computation but not following 
from superposition. 

Computed and measured replay signals for three equally spaced transitions 
are shown in fig. 33 (b = 4 um) and fig. 34 (b = 3 um), together again with 
(dashed) the results obtained by superposition. For b = 3 ym in particular the 
agreement is not perfect but the predicted asymmetry of the signal is found 
experimentally even to a somewhat larger degree. The pictures show clearly 
the invalidity of the superposition principle for cases like those considered. 

We also measured and computed a few points of the density characteristic, 
the results of which are summarized in table VII. Apart from the amplitudes 
(measured and computed, together with results of superposition) we also give 
(denoted by xo) the positions of the observed signal peaks relative to the cor- 
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Fig. 33. Computed and measured three-bit replay signals recorded on Co-P layer for b — 
4 um. Dashed curves obtained by superposition. 
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Fig. 34. Computed and measured three-bit replay signals recorded on Co-P layer for b = 
3 um. Dashed curves obtained by superposition. 
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TABLE VII 


Experimental and computed results for some points of the density characteristic 
of Co-P layer 


Ê, (Oe) measured 


H, (Oe) computed 194 
Ê, (Oe) measured (superposition) 245 


Ê, (Oe) computed (superposition) 


Xo (um) measured 
Xo (um) computed 


responding positions of the recording-gap centre at the instant of current 
switching. As we already saw in sec. 3 for a square-loop material, the model 
predicts that this distance will increase with increasing density. The measured 
results show, at least qualitatively, the predicted tendency. 

For the iron-oxide coating we only compared the observed single-pulse 
shape with the computed one, for 8 mA signal current. This material has 
4nMp = 650 G, H, = 286 Oe, and t = 1:2 um. The loop shape could be 
approximated roughly by taking q = 0-7 and r = 0-2. We took a= 0:9 um, 
thus assuming a head-to-medium spacing of 0-3 um. When computing the 
replay signal an obvious integration over the coating thickness was incor- 
porated. The result (fig. 35) shows that height, width, and position of the pulse 
are fairly well predicted. The observed pulse shows, however, a relatively large 
asymmetry which is not predicted by the model and which must be ascribed 


-20 “15 -0 -5 0 5 10 15 20 
—— x (um) 


Fig. 35. Measured (dashed curve) and computed (drawn curve) single-pulse shape for thin 
iron-oxide layer. : 
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to the effect of a perpendicular magnetization component. Attempts have been 
made (see e.g. ref. 9) to account for perpendicular magnetization in self-con- 
sistent models by treating it, as affected by the perpendicular field, in a similar 
way to (and independent of) longitudinal field and magnetization components. 
In our opinion, a rather more sophisticated hysteresis model would be required 
for oxide coatings, taking both components into account simultaneously. 


6. Conclusions and final remarks 


It appears that simplifications of digital-recording theory based on the 
assumptions of arctangent transitions and the superposition principle lead in 
many cases to results which are significantly different from those obtained with 
the more exact assumptions underlying “self-consistent” recording models. 
The use of the arctangent transition in the way it is done in the slope theory is 
fundamentally inadmissible. Furthermore, from the results of sec. 5, it appears 
that even the extremely simple hysteresis model we use is well able to explain 
the observed recording behaviour of thin metal recording layers. 

An obvious question is in how far the results of this paper, at least qualitative- 
ly, could be applicable to thin (1 um, say) oxide layers, in situations where 
the assumption of uniformity of field and magnetization across the layer seems 
still reasonable. We already mentioned, in the previous section, the complica- 
tions arising from the occurrence of perpendicular field and magnetization com- 
ponents; a few more remarks can, however, be made. The first of these remarks 
concerns the thin-layer approximation used throughout this paper. Demagnet- 
ization effects, for a given value of p, become less severe if no longer 4x M, R> H, 
(or, which is the same, £ <p). In order to get an impression of the relative 
importance of this effect we computed the magnetization distribution and the 
replay signal for a single pulse recorded on a square-loop medium having 
Az: Ma = 1000 G, H, = 250 Oe, and t = 1 um (thus p = 4 pm) for a = 1 um, 
g — 2 um, and s = 4. We found a transition length of 3-4 um and a pulse 
width of 3-6 um. When assuming 4xM4 = 10000 G and t = 0-1 um with the 
other parameters unchanged we found a transition length of 4-0 um and a 
pulse width of 3-9 um. It thus seems that the thin-layer approximation, when 
applied to a case where 4x: M&/H, is in the order of magnitude common for 
oxide layers, gives rise to an error which, although not negligible, is not very 
large. Our second remark concerns the loop shape. Oxide layers, in particular 
those containing unaligned uniaxial particles, have often much less-square loops 
than continuous metal recording layers. This makes it not only more difficult 
to fit them to our hysteresis model but it also implies that deviations from the 
superposition principle must more readily be expected as predicted by Tjaden 9) 
in an analysis based on the Preisach model and neglecting demagnetization 
effects. It is possible that these latter deviations partly compensate those result- 
ing from demagnetization effects. The better agreement of superposition for 
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larger values of q (only for the yu heights, see table VI) may thus be inter- 
preted. 

We restricted our treatment to conventional heads with a relatively short gap 
(*Karlqvist heads"). We want to stress, however, that a similar analysis 
including the effect of the presence of the head on the demagnetizing field can be 
extended without much difficulty to more-general head geometries. Therefore 
it should be admissible that the problem is treated as two-dimensional, that the 
head permeability may be taken infinite (without saturation effects) and that 
the head field configuration can be expressed by means of a complex potential 
function (e.g. long-gap conventional head +8), thin-film head +°) or magneto- 
resistive replay head). The kernel of the integral expression for the demagnet- 
izing field (corresponding, for a Karlqvist head, to the second factor in the 
integral of eq. (1)) follows immediately from Green's function for the con- 
figuration and can be explicitly expressed in terms of this complex potential 
and its derivatives. This function (or, in fact, the matrix \|4|| which was defined 
in eq. (5) of ref. 6) needs to be computed only once for a given configuration. 
Then, for this configuration, the recorded magnetization in free space can be 
computed without extra complications for all the cases it is desired to consider. 
The replay process requires some more attention if r + 0. Then the magnetiza- 
tion on the recorded layer changes with the relative position of the replay head 
and for each position considered a new series of iterations is required. These 
concern mainly reversible changes and, relatively, do not take much computing 
time. 


Appendix 
Analytical solutions to the free-space demagnetization problem for a square-loop 
material 


The situation after free-space demagnetization of a single transition recorded 
on a square-loop medium is frequently as sketched in fig. 36. Along the medium 
can be distinguished five regions (see figure). In the regions (D), (III) and (V) 
the magnetization is unaffected by the demagnetizing field Hp, contrary to the 
situation in the regions (IT) and (IV) where |a »| equals the coercivity H.. We 
shall show that, on the additional assumption that M (x) varies linearly in (III), 
and within the limitations of the thin-layer approximation used throughout most 
‘of this paper, an exact analytical solution can be found for H; p(x) and M(x) sub- 
ject to these ‘conditions. For this purpose we consider the external field of the 
layer, with components H,(x, y) and H,(x, y). The component Hy should be 
continuous across the layer whereas, from div (H + 4% M) = 0, it is easy to 
derive that, in the limit £ — 0, Mg t constant: 


d/M 
Hy (x, 0) — Hya(x, 0) = —4 (Mz t) ss. x) (14) 


INVESTIGATIONS OF DIGITAL MAGNETIC RECORDING ON THIN MEDIA 155 


M/Mg , H/Hc 


-1 


Fig. 36. Schematic indication of magnetization (drawn curve) and demagnetizing field (dashed 
curve) after free-space demagnetization of isolated transition on square-loop medium. 


in which H,, and H,, represent the values of 'H, at the layer surface, for 
y} 0 and y TO respectively. On symmetry grounds we have therefore 


H,,(x,0) = 0) = —27 (Mp t ial ues 15 
y(x, )=— y2(X5 ) = —27 ( R (>) ( ) 


Furthermore, outside the layer, we have div H = 0 and curl H = 0 from which 
it follows that, for y # 0, H, — iH, is an analytic function of z = x + iy. 

We put again p = 4zM&t/H., and introduce new variables £ = 2x/p and 
n = 2y/p with ¢ = £ + in. Furthermore we put m(£) = M(x)/Mp, hy(£) = 
H,(x)/H, and w(£) = u(&, n) + iv(£, n) = (H; —iH,)/H,. Then, according to 
the foregoing, w(¢) is analytic for 7 > 0 whereas 


- dm(£) 
w(5,7)— h(E) and vl, n) —> "aE for 50. (16) 
Now we assume that, in region (III) 
m(£) = £o (17) 


and denote the boundaries between the various regions by £= + f and 
é = + y (see fig. 37a). We should then have 


m(éy=0 (E< —), 

hf)=1 (~< < —P), e 

m'(E =a! (E < p, (18) 
hy£ 2—1 (B<&<y), 

m(éy=0 (£y) 
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b) 
Fig. 37. Relation between (a) ¢-plane and (b) w-plane. 


According to (16) these conditions imply that the function w(¢) maps the upper 
half ¢-plane on the interior of the rectangle in the w-plane indicated in fig. 37b. 
The mapping function which accomplishes this is, in the usual notation of 


Jacobian elliptic functions (see e.g. ref. 20) 
w) = — - sn! (y[C). 
K 

The modulus k is determined by the condition 

K]K' = a, 
whereas f and y appear to be related by 

B —ky. 
A further relation between f and y follows from the fact that 


[ 2 at = mig) — m = 1 Bis 
J dé 


B 


(19) 


(20) 


(21) 


(22) 
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If we work out these results in terms of standard elliptic integrals, we may 
summarize the final result as follows: 


a = K|K', P=2kK/x, y-2K]m, 
1 — £2/zy2)1/2 
ae ly?) js 


26 
m(é) =1+ —F (ares — 
my k' 


— arc 
MA 


t 


) 2 AL SA]? 
— Sin ————————— 
(B<é<y). (23) 
If a = 0, the analysis is simpler and we find 
2 . 
w(£) = — — arcsin (y/C) (24) 
p 
with f = 0 and y = 1. The magnetization distribution is then given by 
2 . 
m(&é) = — (£ arcosh lg + arcsin £) cél <1). (25) 
1 


Figure 38 shows m(£) and A5(£) for various values of «, together with (dashed) 
the usual arctangent approximation ++) 


2 27 È 
m(é) = = arctan (x &), h(é) = — Ipae (26) 


Fig. 38. Reduced magnetization m(€) and demagnetizing field A5(£) for various values of «. 
For comparison, arctangent approximation (dashed curve). 
" 
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for the problem's solution. The case « — 0, which we have if the original tran- 
sition prior to demagnetization is step-like, represents the shortest possible 
transition consistent with free-space demagnetization conditions. It ranges from 
x = —p/2 to x = p/2; hence p, as we defined it, represents the minimum tran- 
sition length. The transition shape and length do not depend much on « as 
long as « < 0:5, whereas for « > 1-5 an originally linear transition is prac- 
tically unaffected. 

The replay signal from these transitions is easily found because our analysis 
has supplied an expression for the external field in free space. With w(¢) given 
by (19) or (24) we have 


2 
A(x, y) = H. Re LE (x 4- i bb) } 


; Q7) 
H,(x, y) = —H, sign (y) Im Ee (x+i bb) | 

The field strength at the surface of an ideal zero-gap replay head located at 

y = —a is thus 


A(x) = 2H, im EE (x + id). (28) 
p 


This result can be expressed in terms of standard functions of real argument 
and was used for computation of the curves shown in figs 39 and 40, showing 
respectively the reduced pulse width PW;o/a and the reduced pulse height 
H,(0)/H,, both versus p/a. The parameter is « p/a which represents the length 
of the original linear transition, reduced by a. For comparison, both figures 
give a dashed curve which shows the corresponding result for the arctangent 
approximation (26). If we compare, in fig. 39, this curve with that for 
a pla = 0 then we see that the differences are relatively small (< 15%) as 
long as p/a < 30. For large p/a we find from (24) and (28), after some cal- 
culations, PW;so/a œ 2 (p/a)'/*. The difference with the arctangent-approx- 


arctan approx. 


bi 1 10 100 
— p/a 


Fig. 39. Reduced pulse width PW 50/a vs p/a. The parameter is « p/a. For comparison, result 
from arctangent approximation (dashed curve). 
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Fig. 40. Reduced pulse height H/H, vs pja. The parameter is « p/a. For comparison, result 
from arctangent approximation (dashed curve). 


imation result is more pronounced in the pulse height which for « = 0 appears 


to be given by 
H,(0 4 
HO = — arsinh £z ; (29) 
H, 7 2a 


If pja — 1 the pulse shape starts deviating significantly from that which 
follows for the arctangent transition. This is shown in fig. 41 where pulse 
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Fig. 41. Pulses for various values of p/a, normalized with respect to height and width (æ = 0). 
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shapes for various valus of p/a are depicted (for «x = 0), all normalized with 
respect to pulse height and PWs 9. For p/a «1 we [have approximately 
Hx) H0) ~ (1 + x?/a”)-1. In the region 1< p/a < 10 the pulse gets 
narrower at the foot, showing less-pronounced “tails”. For larger values of 
pla the foot broadens again and the peak becomes sharper. 

A similar analysis for two transitions or for three equally spaced transitions 
appeared to lead to rather intricate results, involving elliptic integrals even for 
œ = 0. The case of an infinite sequence of equally spaced transitions turns 
out to be relatively simple for « = 0; after some calculations it is found that 
the replay-signal amplitude is given by 


mc ute. ^ (blp < A) (30a) 
= — Ìn coth —, ; a 
H, z 2b -> á 
H, 4 ma 
= — arsinh (s cosech =) (b/p > A), (30b) 
H, 7 b 


in which 5 is the transition spacing and o is determined by the equation 


p 4 7^ arcsin (c sin p) dp 
-=< | ————— (31) 
b x à sin 9 


and in which A = z?/8C ~ 1:347 where C is Catalan’s constant. The deriva- 
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Fig. 42. Upper limit for density characteristic of square-loop medium; reduced amplitude 
H,/H, vs reduced bit separation b/a for various values of p/a. 
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tion of this result is left as an exercise to the reader. Equation (30a), in a more 
general form for non-zero layer thickness, had already been published in an 
earlier paper by Tjaden +°), in which a factor 2 was omitted by mistake. These 
results, which represent an upper limit for the density characteristic of a square- 


loop material, are depicted in fig. 42, which shows A,/H, vs bja for various 
values of p/a. 


Eindhoven, March 1975 
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MATHEMATICAL AND NUMERICAL ANALYSIS 
OF COAXIAL CABLE 


by G. C. GROENENDAAL and R. R. WILSON *) 


Abstract 


The primary and secondary parameters of coaxial cable have hitherto 
been calculated mainly for cables with thin outer conductors and for 
the higher frequencies, while the influence of the outer domain has been 
ignored. This article presents a comprehensive calculation of these 
parameters without the said restrictions. The cable model used is 
verified by measurements on a 1-2/4-4 mm coaxial cable. 


1. Introduction 


The electrical properties of a coaxial cable (see fig. 1) are completely defined 
by the primary and secondary parameters. The primary parameters, per unit 
length, are: resistance (R), inductance (L), conductance (G) and capacitance (C). 
The secondary parameters per unit length are: the characteristic impedance (Zo) 
and the propagation constant y. 


Fig. 1. Cross-section through coaxial cable. 


To obtain for these parameters formulae which can be used in practice, 
investigators in most cases confine their attention to coaxial cables in which 
the thickness of the outer conductor is either small in relation to the inside 
radius or large in relation to the skin depth. In all cases in which neither of 
these conditions is satisfied allowance has to be made for the influence of fields 
outside the coaxial cable (outer domain). 

Our aim is to give a comprehensive calculation of the primary and secondary 
parameters in which the influence of fields set up beyond the cable is taken 


*) NKF Kabel B.V., Delft, The Netherlands. 
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into account. The calculation will be verified by comparing it with the results 
of measurements on a 1:2/4-4-mm cable as recommended by CCITT. 


2. Calculation of the electric and magnetic fields 


We start fiom Maxwell's equations !) for a linear, homogeneous and isotropic 
medium, and we will assume that the fields and sources are harmonically time- 
dependent (exp (jwt)): 


VXH =J 4r jo D, V.D-o, D-—zcE, 


1 
V xE = —jo B, V.B-0, B-—uH. o) 


Let us suppose jw € = o + jw e for dielectrics. For conductors this equation 
reduces to jw € = o since the displacement current is negligible in relation to 
the conduction current +). Substitution in (1) yields: 


VXH- jotE, 


: Q) 
V XE = —jo uH. 
If we assume that the fields are divergence-free, it follows from (2) that 
V?^E =K? E, 
: (3) 
V? H = k? H, 
where 
k? = jw u o — œ? € u = —w? E y. 


In practice the dimensions of the cable are such that higher-order modes are not 
excited. We therefore confine our attention here to TMo, waves. If we now 
consider only the wave travelling in the positive z direction and assume sym- 
metry of rotation, the differential equations (3) can be solved by the separation 
of variables. For that purpose we take 


X(r, z) = f (r) exp (—y2); 


where y represents the propagation constant. 
For a cylindrical coordinate system the differential equation for E,(r) then 
becomes: 
o?E(r) 1oE(n) 


——— +h? Er) — 0, 
d?r? r or E 0) 


with 4? = y? — K?, 
The general solutions are then ?:3): - 


- 
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E,{r) = [A Jo(hr) + B Yo(hr)] exp (72), 


jwe 
H,(r) = EH [A Ji (hr) + B Y,(hr)] exp (—yz), 


Er) = : [A J (ir) + B Ys (hr)] exp C792). 


where J, and Y, are Bessel functions of the nth order and, respectively, of the 
first and second kind. 

The following formulae ?:*) apply to the various areas in a coaxial cable 
(exp (—yz) has been omitted for the sake of convenience) (see fig. 1): 


medium 1 limits formulae 

inner conductor Oxrsza E, (r)= 4 Jo(ir) 
Ayr) — p1 4i Ji(hır) (4a) 
hy? = —jo uo 2, 

medium 2 

dielectric a<r<b_ E,,(r)= A; Johar) + Bz Yo(h;r) 
Ho (r) = p2 [42 J (har) + B,Y,(h,)) (4b) 
h? = y? + o? & po 

medium 3 


outer conductor b<r<c_ E,,(r)=A3Jo(hsr) + Bs Yo(hsr) 
Hr) = Ps [45 Jilhsr) + Bs Yi(h,r) (4c) 
hj? = —jo uo 3 

medium 4 

outer domain roc E,(r) = Ag Ho (har) 
Heer) = pa 44 H1” (har) (4d) 
ha? = y? + 0? & Ho 

with p, = jw &,/h, and where H,” is the Hankel function of the nth order 

and the second kind. 

For the computation of the propagation and integration constants see ref. 3. 


3. Primary and secondary constants 


We now introduce the surface impedances, which are defined as follows ^): 


1 E,,(2) 1 E,,(b) 1 Eo 
 Qna Hy(a)’ ~  2ab Hp)? Ae Hec) 


(5) 


aa 


Since E, and H, are continuous at the boundaries, we have: 
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E,,(@) E.) 
Hy(aà  Hj(a)' 
E.(b E.) 
HQ — H0) 
E.(¢) EO 
Hosle) Hy) 
Substitution of (4) and (5) in (6) yields: 


Az Jo(h5a) + Bz Yo(h2a) 
A5 Ji(h5a) + Bz Yi(h2a) 


(6) 


T 2x ap», Zao 
(7) 
Az Job) + Bz Yo(h2b) _ 


M = -27 bp. Z, 
Az Ji(igb) +B, Yb) 


After elimination of A, and B, and application of the approximation formulae 
for Bessel functions with small arguments (which is permissible because in the 
dielectric har « 1) we find: 


h2 jo 2n E, jo 2m e2 ez LZ ) 

= x 
2 In ‘In (b/a) J a) aa bb. 

$ 1 + [jw x Ez Zaa Zop (b? — aP) (Zaa + Zos) 
1—[jw x &[1n (b/a)] a? Zaa 1n (At; b y'[2) + 5? Zos ln (; a y'/2D)] 

with y' — 0-577212 — Euler's constant. 

It was verified with the aid of a computer programme that for all types of 
coaxial cable occurring in practice and for all frequencies 


(8) 


is a very good approximation foi (8) (maximum inaccuracy 19/55). For the 
propagation constant we now find (see (4b)): 


= jo 2x 
2 ——w? F uz 9 
y s T T Zw). (9) 
Suppose *): 
.jo 2x &; 


c 
MGE, e 
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Introducing 
Ez = £o (& —j &") = £o &' (1 —j tan ô’), 


where ó' is the loss angle of the dielectric, we find 


2% Eq Er 
^ In (fa) ’ 
(b/a) (10) 
G = w C tan ô’. 
The equation *) 
y? = (R + jo L) (G + jo C) 
is of general applicability. Comparison with (9) yields: 
R + jo L = Zaa + Zo T jo Le (11) 
where 
b 
L= al In - 
2% a 


represents the external inductance. It follows from (11) that 


R= Re [Zaa + Zvl, 
L = Im [(Zaa + Zy)lo] + Le 
The secondary parameters Zo and y can be expressed in terms of the primary 
parameters R, L, G and C +). Let us take: 
Zo = Z, +j Zi, 
y=«+jß. 


Separation of the previously found formulae into real and imaginary parts then 
yields: 


[R tan ð + e L + (R? + o? L?/?]/ (Q), 


r 


~ Qo C) 
1 oLtanó —R 


ELLE cc i 
(2m C)!? [R tan ó' + o L + (R? + o? L2)! 2] (Q) 


Zi 


C 1/2 
Ps (=) [R tan 6’ —wL + (R? + w?L?)"}/2 (Np/m), 


(* =)" wLtand’+R 


a) ume e e cue c (iO) 
2) Ransol} orap Cm 


MATHEMATICAL AND NUMERICAL ANALYSIS OF COAXIAL CABLE 167 


Use has been made here of the fact that for modern cable-insulation materials 
tan? ó' « 1. 

The primary and secondary parameters in these formulae are expressed in 
terms of Zaa and Z;,. Calculation of the surface impedances will be considered 
in greater detail in the next section. 


4. Calculation of surface impedances Z;4 and Zpp 
4.1. Inner conductor 


Starting from the definition of Z,, (5) and introducing the modified Bessel 
functions we derive: 


= kı Io(k,a) 
ý 2% a0, I(k,a) 


(12) 


a 


4.2. Guter conductor 

It is assumed that the outer conductor is made up of one or more laminated 
conductors (not insulated from each other), while the whole is surrounded by 
the outer domain. We now wish to allow for the effect of this domain and any 
additional layers in the outer conductor in Z,, (5). We therefore start from 
layer n in the outer conductor (see fig. 2) which is bounded by r, and r,, ,. 
Using the relations: 


TAF kn tn) = exp (jn 7/2) Ik, T) 2 
Y, kn tn) = exp [j (n+ 1) /2] Ilkan tn) — — exp (jn 2/2) Kalku rz) | 
EA 


—3 < arg (kn rn) < 


n]a 


and formula (4), we now obtain: 
| ael à Tolkn Fa) Ko(K, ry) EM 
Nn y NEN Lt, Tn) —Ki(k, nj) B, i 


| E, Uum P X es n1) Kolken rn 1) E 
Nn Hy tas 1) Lk, us 1) —K(k, Fg 1) B, j 


Fig. 2. Cross-section through coaxial cable with laminated outer conductor. 
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Elimination of A,’ and B,' yields: 


RAO S, —P, Nn Eat D] 
Holta) g EA = Rn Hoslfnt 1) e 
with " As B 
Sx) = Ax [I (Ax) Ko(x) + Ki (Ax) Io(x)], 
P,(Ax) = Ax [Io(4x) Ko(x) — Ko(Ax) Io(x)], 
Q,{Ax) = Ax AX) Ki(x) — K (Ax) GJ 
R,(Ax) = Ax A) Ki) + Ko(Ax) 11], 
where 
Tro. kn 
A= >; X= karp; Nn = — 
Yn Ox 


(see also ref 5). 

On the basis of the continuity conditions for fields at the boundaries we find 
the following connection between the E, and H, fields at r, and r,,,, with the 
aid of (13): 


E,4("n) = S P E iin) 
Peal iu E | Frae aea 


| S ML | | | Sn+k-1 —Ln4k-1 fd 
—Q R —Qw.k- il In k-à Rat n-1 


k=1 


with 


Using this formula it is possible to express Z,, in terms of the surface impedance 
Zee Of the outer domain. It then follows from (5) that 


1 S—P/2xc Zec 


ea te iue 14 
"^ 2zxb Q— R[2nc Zee 09 
with 
» 1 Hy(h,c) 
"Ine p, H,Oh,c) 
and 


h4? = (R + jo L) (G + jo C) + o? & us. 
Further examination of (14) shows that Z,, cannot be calculated directly since 
Ze can only be found if A, is known. The formula for A, contains the primary 
parameters which depend, inter alia, on Z,,. The conclusion then is that Z,, 
can only be calculated using an iterative process. 


The process begins with Z,, — oo. Equation (14) then becomes: 
1 S (15) 
"xb l 


Zob 
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We thus obtain the familiar formula for Z,,, in which the fields in the outer 
domain are not taken into account (see also ref. 4). Then the primary and 
secondary parameters are calculated, followed by h, and Z,,. The iteration 
process is ended when the loss has been calculated with a prescribed accuracy. 
The process need not be carried out when 


P R 
——-«8S and — < Q. 
2z c Z« \ 2z € Zec 


In practice these conditions are amply satisfied for frequencies higher than 1 MHz. 

A computer programme has been written with the aid of which it is possible 
to work out the primary and secondary parameters. The flow diagram of the 
programme is shown in fig. 3. 


Initialize : 
d,D, Gcusdgeil, 
01, 05,0, Eps Mp, 
amax,tan Ô, tol, f [n] 


Zec — co 
n:=O0;counter:=0} 


Compute:G ,Zaq 


n:zntl 
counter: zO 


Fig.3 


Fig. 3. Flow diagram of the computer programme. 
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5. Practical example 


The computer programme was applied to a 1-2/4-4-mm coaxial cable as 
recommended by the CCITT. The ambient temperature of the cable was set to 
10°C. Computations were effected for the following cases: 


number outer conductor 


copper foil 
copper foil -+ steel tape 
copper foil + steel tape 


surface impedance Zec 
infinite 

infinite 

finite 


The input data for cable model no. 3 were (the thicknesses of copper foil 
and steel tape are denoted by dc, and degree, respectively): 


2a = 1193 mm o, = 60 (. 10° S/m) E = 1:14 
2b 4-39 o, = 60 (. 10 S/m) u = 250 
deu 0:18 o,= 8(. 10° S/m) tan 6’ = 1-6 (. 1074) 
Aster = 0°15 
I 1000 m 


Table I (see pp. 173/174) shows the results of the computer calculation for model 
no. 3. This cable model was verified by comparing the calculated and measured 
insertion losses 9) of a 1-:2/4-4-mm coaxial cable with a length of 495 m and 
a source and load impedance of 75 ohms with each other. Both values are 
shown in table II as a function of the frequency. The relative deviation in 
relation to Gneor is also listed in this table for cases nos 1 and 2. 

The insertion loss was measured with Walter and Goltermann's Level 
Measuring Set PSM7 with an accuracy of + 1 mB and a minimum frequency 
of 10 kHz. The measurements were performed in NKF's Telecommunication 
Development Laboratory at Delft. 

Figure 4 shows the relative deviation of the measured from the calculated 
insertion loss. 


Ci rend Ld eat — UT E41 
2 5 


0 2 5 0% 2 5 0 2 5 10 
— —»- f (kHz) 


105 


Fig. 4. Relative deviation in the measured loss versus frequency. 
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Figure 5 is a graph of the values in table II for the 0-1—1000 kHz frequency 
range. This shows that the steel tape round the copper foil produces a lowering 
of the loss at the low frequencies. When the skin effect becomes sufficiently great 
(6 « a) the difference disappears. It can also be seen that allowing for the effect 
of fields extending to the outer domain results in an increase in the loss (model 
no. 3 compared with no. 2). The relative differences in R and L for the various 
models are shown in figs 6 and 7, respectively. 


ptt LL IL [nb eee] | 
C M D NEMSE See 
— QW — 
COHN ee 
a 
Cmeasured -XNo3 
Nea, 
Ep De E FT TI 


3 


Fig. 5. Relative deviation in the calculated and measured losses versus frequency. 


HOT I esm 
pil __.— Phor = Phos 
N Rio 


Proz — Fas = unl 


Fig. 6. Relative deviation in the calculated resistance versus frequency. 


B 
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BEL ees 
ERAT ENSE UT LG 
Oe er A 


/ LET. 3e LC 
= 


PAL rth TL 
Fini ae T 
NT cele bL - EIL SLE 
-6 

io 2 5 10^ 2 5. 


(d 2 5 10? 2 5 


-4 


-5 


103 
—> f (kHz) 


Fig. 7. Relative deviation in the calculated inductance versus frequency. 


6. Conclusion 


This article has indicated how the primary and secondary parameters of a 
coaxial cable can be derived, allowance being made for the effect of fields set up 
outside the cable. There are no restrictions with regard to the wall thickness 
of the outer conductor or the frequency range to be chosen, provided that the 
higher modes are not excited. 

The cable model was verified by comparison with the computed and measured 
results for a 1-2/4-4-mm cable as recommended by CCITT. There is good 
agreement between the results (maximum difference 0-9 77). 
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f (kHz) 


R (OI) 


7-20-7825 
+20-7832 
-+20-7854 
4-20-7874 
41-20-7892 
-F20-7910 
+20-7927 
+-20-7961 
+20-7998 
-+20-8037 
+20-8224 
+20-8531 
-F20:8911 
71-20-9357 
--20-9858 
-21:0999 
+21-2282 
+21-3679 
+22-0242 
+23-0575 
+24-2898 
+25-6456 
+27-0512 


L (mH/I) 


4-:3480 
T3441 
+-3360 
+-3318 
+-3296 
+:3283 
+3274 
+:3262 
+:3255 
4-:3250 
4-:3238 
1-:3230 
4-324 
4-:3219 
4-:3214 
--:3205 
+:3197 
+3189 
+-3165 
4-:3140 
4-:3117 
--:3095 
4-:3072 


G (u5/I) 
:0049 
0059 
‘0098 
0147 
0195 
‘0244 
0293 
0391 
0489 
0586 
0977 
1466 
"1955 
2443 
2932 
3909 
4887 
-5864 
‘9774 
1-4661 

1:9547 
2:4434 
2:9321 


TABLE I 


1 = 1000-00 m 

CFD Z (Q) 
48-6106 586-40 
48-6106 ^ 53585 
48-6106 — 41671 
48-6106 341-93 
48-6106 297-58 
48-6106 —— 26747 
48-6106 — 24537 
48-6106 — 214-59 
48-6106 193-82 
48-6106 178-67 
48-6106 — 143-87 
48-6106 123-20 
48-6106 — 11175 
48-6106 104-52 
48-6106 99-59 
48-6106 93-43 
48-6106 89-85 
48-6106 87-57 
48-6106 83-52 
48-6106 81-83 
48-6106 81-02 
48-6106 80-47 
48-6106 80-03 


Z, (Q) 


—580-17 
—529-11 
—408-27 
—331-74 
—285-91 
—254-49 
—231-20 
—198-30 
—175-67 
—158-84 
—118-46 
—92-36 
—16-50 
—65-58 
—57-49 
—46-21 
—38-67 
—33-28 
—21-58 
—15:37 
—12-26 
—10-43 
—9-22 


« (dB/I) 


:15394 
16847 
21666 
26407 
-30345 
33763 
-36808 
-42094 
46613 
-50575 
-62865 
13522 
:81203 
:87007 
91532 
‘98097 
102632 
1-05994 
1:14555 
1-22419 
1-30275 
1-38498 
1:46895 


B (rad/I) 
:0179 
:0196 
:0255 
-0313 
0364 
0408 
0450 
0524 
0592 
0655 
-0879 
-1129 
-1365 
-1596 
-1825 
2283 
2744 
-3209 
-5102 
-7498 
-9898 

1-2289 
1:4666 


T (us/l) 
28-501 


26044 . 


20-254 
16-619 
14-463 
13-000 
11-926 
10-430 
9-421 
8-684 
6:993 
5:988 
5:432 
5-080 
4:841 
4-541 
4:367 
4:257 
4-060 
3-978 
3-938 
3-912 
3-890 


AT&VO 'IVIXVOO JO SISATVNV TVOMAWON ANV 'IVOLLVASH.LVIA 


£LI 


f (kHz) 


80-00 
100-00 
120-00 
200-00 
300-00 
400-00 
500-00 
600-00 
800-00 

1000-00 
1200-00 
2000-00 
3000-00 
4000-00 
5000-00 
6000-00 
8000-00 
10000-00 
12000-00 
20000-00 
30000-00 
40000-00 
50000-00 
60000-00 


R (QJI) 


+29-8130 
+32-3637 
+34-6798 
+42-4359 
4-50-6032 
+57-8960 
+64-5180 
4-70:5738 
+81-3091 
+90-6535 
+99-0089 
--126:6744 
+154-2939 
+177-6080 
+198-1484 
+216-7184 
+249-6923 
+278-7448 
+305-0112 
+392:7400 
+480-2163 
+553-9637 
+618-9373 
+677-6783 


TABLE I (continued) 


L (mH/) G (S/D) 


+:3031 
+:2997 
+:2969 
+-2898 
4-:2852 
4-:2822 
+-2801 
14-2784 
+-2761 
+-2744 
+-2732 
4-:2704 
+2686 
4-:2615 
4-:2668 
+-2662 
4-:2655 
4-:2650 
4-:2646 
4-:2637 
+:2631 
-1--2628 
4-:2625 
+-2624 


3-9095 
4:8869 
5:8642 
9-7737 
14-6606 
19-5475 
24-4343 
29-3212 
39-0950 
48-8687 
58-6424 
97-7374 
146-6061 
195-4748 
244-3435 
293-2122 
390-9496 
488-6870 
586-4243 
977-3739 
1466-0609 
1954-7478 
2443-4384 
2932-1217 


C (F/D 


48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 
48-6106 


Z (9) 


79-34 
78-80 
78-38 
TT-34 
76-68 
76-26 
75-96 
75-13 
75-39 
75:16 
74-99 
74-59 
74:34 
74-19 
74-08 
74-01 
73-90 
73-83 
73-78 
73-65 
73:57 
73.52 
73:49 
73-47 


Z, (Q) 


—7:68 
—6-72 
—6-03 
—4-48 
—3-60 
—3:10 
—2-TI 
—2:54 
—2:20 
—1:97 
—1-80 
—]1:38 
—]:13 
297 
— 87 
—-79 
—-69 
—61 
— 56 
—:43 
—:35 
—-30 
Ni 
—25 


a (dB/I) 


1-63328 
1-78530 
1-92358 
2:38609 
2:87108 
3-30376 
3-69700 
4-05708 
4-69651 
5:25401 
5.75299 
7-40722 
9-06146 
10-46019 
11-69433 
12-81156 
14-79884 
16-55346 
18:14278 
23-47151 
28-81630 
33-34663 
37-35646 
40-99654 


B (rad/I) 
1:9386 
2-4068 
2:8727 
4:7246 
71-0257 
9.3164 

11-5996 
13-8774 
18-4219 
22-9567 
27-4853 
45-5636 
68:1147 
90-6359 
113-1382 
135-6270 
180-5761 
225-4984 
270-4016 
449-8961 
674-1126 
898-2353 
1122-2981 
1346-3184 


T (us/l) 
3-857 
3-831 
3-810 
3-760 
3-727 
3-707 
3-692 
3-681 
3-665 
3-654 
3-645 
3-626 
3-614 
3-606 
3-601 
3-598 
3-592 
3-589 
3-586 
3-580 
3-576 
3:574 
3:572 
3-571 


vLT 
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TABLE II 
1—495m 


measured 


(dB) 


+-57616 
4-:57618 
4-:57624 
4-:57629 
4-:57634 
4-:57639 
4-:57644 
4-:57653 
4-:57663 
4-:57674 
4-571725 
4-:57808 
4-:57911 
4-:58032 
4-:58168 
4-:58476 
4-:58822 
4-:59199 
4-:60959 
4-:63708 
4-:66955 
4-770495 
4-774141 
4-:81283 
+:87952 
4-:94176 
+1-16976 
+1-42173 
+1-63089 
+1-82705 
+2-00745 
4-2:32360 
+2-59855 
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1200-00 
2000-00 
3000-00 
4000-00 
5000-00 
6000-00 
8000-00 
10000-00 
12000-00 
20000-00 
30000-00 
40000-00 
50000-00 


. 60000-00 


G. C. GROENENDAAL and R. R. WILSON 


TABLE II (continued) 


4-284618 
4-366585 
+4-48528 
4-517748 
4-578886 
1.634177 
--7-32573 
--8-19442 
--8:98122 

+11-61903 

--14-26480 

--16-50742 

+18-49231 

--20-29418 


| No 1 | No 2 


Omeasured 


. (dB) 


measured 


measured 
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THE COPRECIPITATION OF Zn, Cd AND Hg 
WITH FERRIC HYDROXIDE 


by E. BRUNINX 
Abstract 


A number of variables has been studied in the coprecipitation of trace 
amounts of Zn, Cd and Hg with ferric hydroxyide. Some of these phenom- 
ena can be explained by an ion-exchange reaction between the metal ion 
and the ferric hydroxide. The commonly proposed Langmuir or Freund- 
lich isotherms do not appear to describe all observed coprecipitation 
phenomena correctly. 


I. Introduction 


Coprecipitation (or collection) is the simultaneous precipitation from the 
same solution of a normally soluble component with a macro-component and 
may be effected by formation of mixed crystals, by adsorption, occlusion or 
mechanical entrapment. Coprecipitation which extends over longer periods of 
time is usually called postprecipitation and is not considered here *). Analo- 
gous terms are: gathering, sorption, scavenging and carrying down. 

Coprecipitation was, and still is, considered as a nuisance in classical gravi- 
metric analysis because it impairs the weighings of a precipitate and thus the 
concentration of the desired element °°). In the field of trace-element analysis 
coprecipitation is considered as a blessing since it allows the collection of micro 
amounts of elements on some precipitated compound ?). Many precipitates 
are suitable: the sulphides of Cu?*, Pb?*, Hg?* and Cd?*, zinc oxide, the 
hydroxides of Mn?*, Fe?* and Al?*, as well as some organic precipitates, 
e.g. copper-8-hydroxyquinolate. 

Coprecipitation also plays an important regulating role in natural aquatic 
chemistry. It is realized more and more that compounds like the hydrous oxides 
of Si, Al, Fe and Mn — as well as various clays — are able to store and retain 
large amounts of toxic elements such as Pb, Cd, etc.*:5). Coprecipitation is 
used in this way intentionally in another branch of aquatic chemistry, i.e. the 
cleaning up and decontamination of used, polluted effluents (sewage) 5). 

Much of the earlier work on coprecipitation was done as a sideline to gravi- 
metric analyses in order to be able to check on and to understand these undesired 
phenomena. At the same time the chemical study of some naturally occurring 
radioactive elements also involved coprecipitation phenomena. It is from these 
two fields that the first interpretations evolved 7), together with the formulation 
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of general, Bp] rules, and generally valid for well defined crystalline precip- 
itates. 

A better description of surface pheiomeni ensued explanations in terms of 
Freundlich and Langmuir adsorption isotherms. It also became increasingly 
clearer that a number of coprecipitants (i.e. precipitates that are able to collect 
traces) possess ion-exchange characteristics and that therefore chemical- -equi- 
librium theory could be applied. 

The aim of this work is to obtain data on coprecipitation in a well defined 
medium: (a) in order to employ this technique for the collection of trace 
elements 8), (b) to obtain a qualitative understanding of the coprecipitation 
mechanism. Iron hydroxide was chosen since it has been demonstrated that it 
could be used successfully for the collection of many trace elements. Un- 
fortunately, most of the available data in the literature do not allow an inter- 
comparison, as the experiments were done under widely varying (and some- 
times ill-defined) conditions. 

Following a short summary of the various coprecipitation processes, we 
describe the experiments in which the distribution coefficient (coprecipitation 
yield) was measured as a function of pH, trace-metal-ion concentration, iron 
concentration and volume. The influence of a number of other parameters such 
as complexing agents and time was also examined. 


2. Classification of coprecipitation phenomena 


Kolthoff??) has proposed the following subdivision of coprecipitation 

phenomena: 

(1) adsorption of foreign ions at the surface of a precipitate; 

(2) occlusion, i.e. the incorporation of foreign ions in the interior of the pre- 
cipitate. 

We shall only give a very brief description of the various characteristics; for 
more details the reader is referred to the work by Kolthoff ?*), Laitinen °) 
and Walton 1°). 

In this classification of coprecipitation phenomena we do not take into ac- 
count the concentration range and the reaction kinetics. Many of the coprecip- 
itation phenomena to be outlined were observed in relatively concentrated 
solutions. They were usually confined to the sorption of one element into or 
onto a precipitate. In many of these experiments the precipitates were formed 
and studied under equilibrium or near-equilibrium conditions. 

As already pointed out above our aim is to gain insight into the coprecipita- 
tion technique for purely analytical purposes, i.e. for the collection of more 
than one trace element on an iron-hydroxide precipitate. It is thus important 
that the time be kept as short as possible, so that real equilibria are quite pos- 
sibly not attained. Some of the mentioned coprecipitation processes may there- 
fore very well be of lesser importance. 
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2.1. Adsorption at the surface 


2.1.1. Adsorption of potential-determining ions 
The precipitate attracts (adsorbs) a common ion from the solution: 


BaSO, +  Ba?* $; BaSO,iBa.. (1) 
(surface) ^ (solution) 


Both anions and cations can be bound to the precipitate, mainly by electric 
attractive forces. Many phenomena of this type were described by the 
Paneth-Fajans-Hahn rules. 


2.1.2. Adsorption by ion exchange 
In this case a surface lattice ion is exchanged with a foreign ion in solution: 


BaSO, -+ Pb?+ t5, PbSO, + Ba?^*. (2) 
(surface) (soln) (surf) (soln) 


This can be described by the equation 


T [Ba?* i, [PbSOg]surr (3) 
[BaSOg]suce [Pb?* is 
or 
; [Pb?*] 
=K pe. (4) 
[Ba**] 
where X is the amount adsorbed per unit quantity of adsorbent (mol/mol). 
2.1.3. Exchange between adsorbed counterions 
An example of this type is 
Agl .I-iK* + H* 5; Agl. I | H* + K*, (5) 


where I- represents the iodide anion adsorbed at the AgI; K* is the adsorbed 
counterion. ; 

This reaction can be described as a chemical equilibrium with an equation 
similar to eq. (4). In this case too the Paneth-Fajans-Hahn rules have given 
a number of qualitative descriptions. 


2.1.4. Molecular or ion-pair adsorption 


Cation-anion pairs can occupy simultaneously adjacent sites on the precip- 
itate surface, i.e. equivalent amounts of cation and anions are adsorbed. This 
adsorption process can be described fairly well by the Freundlich isotherm, 
an empirical equation: 


X=KC, (6) 
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where X — amount adsorbed per unit quantity of adsorbent (mol/mol), 
C = equilibrium concentration (mol 17*), 
K — constant. 


2.1.5. Monolayer adsorption 


If the precipitate has only a limited number of surface sites then the Lang- 
muir isotherm gives a better description: 
Xm KC 
X = ———__, (7) 
1+KC 
where X = amount adsorbed (mol/mol), 
Xm = limiting value for X (i.e. monolayer capacity), 
C = equilibrium concentration (mol 17+), 
K =constant. 


2.2. Occlusion in the precipitate 


2.2.1. Mixed-crystal formation 


In this case an anion or a cation of the crystal lattice is replaced by a different 
anion or cation of the same charge and about the same size. The Doerner—Hos- 
kins formula describes these phenomena fairly well. 


2.2.2. Ion entrapment 


The amount of entrapment is to a large extent determined by the conditions 
under which the precipitate is formed: in most cases no equilibrium is reached. 


3. Method of precipitation 


Most experiments were done with KNO, (107? N) as supporting medium. 
For the measurements at constant volume 40 ml of this solution was taken 
and 0-2 ml HCl (4-4 N) was added, yielding a pH between 1-5 and 2-0. This 
low pH value is necessary in order to maintain Fe?* in solution. To this solu- 
tion were then added the desired amounts of Fe?* (as FeCl, in 0-1 N HCl), 
metal ion (ZnCl;, CdCl, or HgCl, in 0-1 N HCl) and the corresponding radio- 
active nuclides ($5Zn, 115Cd, ?93Hg). At this stage the total volume lies between 
41 and 42 ml. Precipitation was carried out with 1 N NH4OH and all pH 
measurements were done with a glass electrode. The precipitate was allowed 
to settle for 15 min and filtered over a Sartorius membrane filter No. 13303, 
pore size 1-2 um. The apparatus is comparable to the one described by Luke 11). 

Beaker and filter chimney were rinsed with 5 ml KNO, 107? N. The whole 
precipitate and 5 ml of the aqueous phase were counted with an Nal (TI) de- 
tector and from the data obtained (after correction for dead time, volume and 
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background) the distribution coefficient D was calculated: 
— Meppt 
Mesoin 


where Me,,, = amount of Zn, Cd or Hg in the precipitate, 

Me,,1, = amount of Zn, Cd or Hg in the whole solution. 
All data are presented graphically as log,o D. Where necessary the standard 
deviation (counting statistics) of D has been indicated in the graphs. For all 
other D values without error bars the standard deviation due to counting 
statistics is less than 0-005 log units. 

Some experimenters have standardized the iron hydroxide by washing away 
all cations and anions as far as possible. The trace elements to be adsorbed were 
then introduced into the solution and adsorbed during a given time. We did 
not adopt this technique as it is rather time-consuming and not readily appli- 
cable to analytical separations. 

Sometimes — after washing — the iron hydroxide is dried at varying tem- 
peratures. This granulation treatment is advisable when iron-hydroxide columns 
are employed since they allow a faster percolation rate. The drying process 
however inevitably reduced the adsorptive capacity !?). 

We have chosen NH4OH for the precipitation as it is obtainable with a 
higher degree of purity than e.g. NaOH: this is important for analytical ap- 
plications +°). Other reagents were not considered. 

Besides KNO; (107? N) two other media were tested: synthetic river water 
(SRW) and synthetic sea water (SSW). The composition of the river water was 
as follows (in mg17+): CaCl, . 2H;O: 294 — NaCl: 216 — MgSO, . 7H;O: 
86 — KCI: 9:5 — (NH4);HPO,: 7-3. The data for sea water 1^) are (in g 173): 
NaCl: 27-2 — MgCl; .6H5,0: 8-1 — KBr: 0:103 — MgSO, .7H,0: 3:4 — 
CaSO, .2H,0: 1-59 — K,SO,: 0-86 — CaCO;: 0:12. 


D ; (8) 


4. pH dependence 


It has been known for some time that ion-exchange reactions play an im- 
portant role in coprecipitation reactions with hydrous oxides 15—17). Accord- 
ing to this idea iron hydroxide can act as an ion exchanger as follows: 


n FeH + Me?* = Fe,Me + n H+ (9) 
or after rearranging and applying mass-action law 
logio D = n pH + log;o K + n logio [FeH] (10) 


where K is the equilibrium constant and nlog,, [FeH] a term depending upon 
the mass of the sorbent. The formula of iron hydroxide, written as FeH, does 
not imply any structure of this type. The results for Zn, Cd and Hg in KNO,, 
107? N, are shown in fig. 1, and for Zn in synthetic river water in fig. 2. 
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$ /Zn, 2.10*M 


SS. Cd, 355.105M 
N. 


4 


Cw Cd, 89.105M 
ES 


mg Hg, 89.10 5M 


Fig. 1. Logio D for Zn, Cd -20 
and Hg in KNO, (107? N) 

as a function of pH. 

Fe = 1:73 . 107^ mol 171. 

The numbers following the 

chemical symbols indicate 

the concentration before pre- 4-0 6:0 80 10-0 
cipitation. — pH 


Fig. 2. Logig D for Zn in 
synthetic river water as a 
function of pH. 

Fe = 1:73. 107^ mol 171. 
The numbers following the 
chemical symbols are the Zn 
concentrations before precip- 


40 6:0 8:0. 10-0 
itation. : —— pH 
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In general the pH-dependence curve shows three distinct parts. Up to 
pH = 5-6 the distribution coefficient is rather small and increases but slowly. 
In the second part the distribution coefficient increases rapidly for all three 
elements up to pH — 8-9. At the higher pH values the distribution coefficient 
drops again. By means of the least-squares method eq. (10) was fitted to the 
data in the second part of the curve, i.e. between pH 6 and 9. The values of 
the slope n and their standard deviations are given in table I. 

When the initial Zn concentration in SRW is varied by a factor of 100 the 
value of n remains nearly constant; it is somewhat higher in KNO, medium. 

Similar results were obtained by Upor et al.1®) who observed a linear rela- 
tion between logio D and pH and a value of n = 1. These results differ in 
two aspects from those obtained by Gadde and Laitinen !?): they found a 
nonlinear relation between log,o D and pH; in addition their values of D are 
lower. 


TABLE I 


Value of n at varying pH and constant metal concentrations 


element; concentration 
(mol/l) medium 
before precipitation 


; KNO,, 10-2 N 15 +01 

Zn; 75.1077 SRW *) 12 +01 

Zn; 75.1076 SRW 11 +01 

Zn; 75.1075 SRW LI +01 
Cd; 3:5.1075 KNO;, 107? N 1-04 + 0:05 
Cd; 8.9.1075 KNO, 0-60 + 0-04 
Hg; 2:7.10-° KNO; 0-69 + 0-03 
KNO, 0-64 + 0-02 


*) Synthetic river water. 


The low value of n at the higher Cd concentration cannot be explained: in 
view of the similarity of Zn and Cd one would not expect such a decrease. 
Here also Gadde and Laitinen !?) observed a nonlinear relation between 
logio D and pH. 

The distribution coefficient of Hg, both at high and at low mercury concen- 
trations, shows the same kind of pH dependence, although the highest distribu- 
' tion coefficient is observed at a somewhat higher pH value. The value of n 
is definitely lower than for Zn and Cd. Shimomura et al.??) measured a similar 
pH dependence. Their value of n (= 1) is probably due to the very high CIT 


184 E. BRUNINX 


concentrations resulting in a rather different distribution of Hg species as com- 
pared to our medium. Lockwood and Chen ?!) observed no pH dependence 
between pH 4 and 10. It is not clear whether this effect is due to the different 
conditions of precipitation. 

. All three experiments indicate clearly that the value of n is not 2 as might 
be expected from the valency of the three elements. In other words: it is un- 
likely that any of the three metals is bound to ferric hydroxide as the bare 
cation Me?*. Since we know the ionic composition of the solution we can 
easily calculate the percentage composition of the different species present at 
any moment. We have made the calculation for a pH = 8-5, taking into ac- 
count the chloride and the ammonia concentrations. The equilibrium constants 
were taken from Ringbom ??) and the percentage of the different chemical 
species (metal bound to a ligand (MeL,) and free metal (Me)) was calculated 
according to the formulae 2°) 


J 
% MeL, 100 8, [L] 


BESTAN 
100 


AN ATTER uu 


(11) 


and 


where [L] — ligand concentration 
B, = overall equilibrium constant for the reaction Me--nL 5 Me L,. 
The results are summarized in table II. 


TABLE II 
Distribution of various species at pH = 8-5, pCl- = 1-57; final base concen- 
tration: 3.1072 N 


% composition 
species 


MeOH* 
Me(OH); 
MeCI* 
MeCl, 
MeCl, 
Me(NH;)** 
Me(NH3);* * 
Me(NH3)4* * 
Me(NH3),* * 
Me** 
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It follows that with NH4OH the amounts of free-metal ion, chloride com- 
plexes and hydroxyl complexes are negligible in all three cases and that virtually 
all the metal not coprecipitated with iron is complexed with amine. In a separate 
experiment using NaOH instead of NH,OH we measured the distribution ratio 
for the three elements. In all three cases the distribution coefficient D remained 
practically unchanged even when the concentration of the species in equilibrium 
with the iron hydroxide changed drastically. We may therefore conclude that 
the coprecipitation of Zn, Cd and Hg proceeds only through the intermediary 
of either the hydroxide or chloride complexes. Keeping in mind the values of n 
(see table T) one would expect that a singly charged species such as MeOH* 
or a mixture of doubly charged and neutral species would be bound to the iron 
hydroxide. The data in table II show also that in the NH,OH precipitations 
the solubility product is nowhere reached and that this effect can be entirely 
neglected. 


5. Dependence upon the amount of iron precipitated 


On the basis of eq. (10) we can also expect that at constant pH, logio D 
will increase linearly with logio [FeH]. Figure 3 shows the results of the 
measurements at constant pH — 8:5, and constant trace-metal concentration. 
In table III the values of n and their standard deviations are summarized. 


-50 40 
— logy [Fe] (mol/1) 


Fig. 3. Logig D for Zn, Cd and Hg in KNO, (107? N) at pH 8-5 for varying initial iron 
concentration. The initial metal concentrations are (moll^2): Zn = 3:2 .1076; Cd = 
3:5.1075; Hg = 2:8.10- 6. 
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TABLE III 


Value of n for varying Fe concentration and constant hydrogen and metal-ion 
concentrations 


element; concentration 
(mol/l) medium n 
in original solution 


Zn; 32.10-5 KNO;, 1072N | 12 +02 
Zn; 79.1076 SRW 09 +01 
Cd; 3:5.10-5 KNO,, 107? N | 1-60 + 0-08 
Hg; 2:8. 10-6 KNO,, 10-2N | 0-80 + 0-02 


There is some agreement between the values of n obtained from these ex- 
` periments and those in table I, except for Cd. No other experiments have been 
reported in the literature except those of Shimomura et al.?°) who studied the 
coprecipitation of Hg?* with iron hydroxide. They observed a value of 1:2 for 
n for a medium quite different from ours (much higher chloride concentration). 
They observed the same value of n at other pH values. 


15 25 
— logg” (ml) 


eects 


Fig. 4. Logio D for Zn, Cd and Hg in KNOs (107? N) at pH 8:5 and varying initial volume. 
The amount of metal added is Zn = 0-15 pmol; Cd = 0-14 pmol; Hg = 0:11 pmol; 
[Fe] = 7:16 pmol. 
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6. Volume dependence 


Keeping the iron/metal ratio constant and changing the volume in which the 
coprecipitation takes place, one might expect to obtain a constant distribution 
coefficient. 

Unfortunately this is only partly true (see fig. 4). Above a certain volume 
(V > 80 ml or logio V < 1-9) D decreases with increasing volume. This effect 
seems not to be connected with the individual coprecipitation reactions, since 
the break in the curve appears at the same point for all three elements. It is 
probably due to the characteristics of the iron hydroxide, but no clear explana- 
tion is available. The few data in the literature confirm this phenomenon ??). 


7. Influence of the amount of metal added 


In these experiments the amounts of Zn, Cd or Hg were changed, all other 
parameters being constant. The results are shown in fig. 5. With all three metals 
a decrease of log, D is observed for increasing amounts of metal. Qualitatively 
this can be explained by the fact that the iron hydroxide possesses a limited 
number of free sites for binding the metal ions. Increasing the total metal-ion 
concentration thus results in a larger fraction of metal remaining in solution. 

Many attempts have been made to give this behaviour a more quantitative 
description by means of different adsorption isotherms, usually the Freundlich 
or the Langmuir isotherm, and in a number of cases these isotherms fit the data 
reasonably well. 


-60 -50 -40 
— logo [Me] (mol/t) 


Fig. 5. Logio D for Zn, Cd and Hg in KNO; (107? N) at PR = 8:5.and varying metal 
concentration (before precipitation). Fe = 1-73 . 1074 mol 17* 
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Two remarks must be made in this respect. 

(a) The original Langmuir equation was proposed for gas adsorption on solid 
surfaces with the assumption that the surface would be covered only by a 
monolayer of gas molecules. In coprecipitation experiments it has never 
been shown that — mutatis mutandis — only a monolayer of an element is 
adsorbed onto the surface. 

(b) Many — but not all — published data apply the Langmuir equation over a 
narrow range of equilibrium concentrations. As there is usually some spread 
of the data points, possible deviations are very unlikely to show up. 

In table IV are summarized the measured values for the various parameters 
of the two types of isotherms. A (—) sign after the chemical symbol means that 
the fitting was statistically not significant 75), or, in other words, that the pro- 
posed isotherm — either Langmuir or Freundlich — does not describe the data 
correctly. In general the Langmuir isotherm does not appear to be a good 
choice. From the scarce literature data for similar systems we can conclude 
that the agreement is fair, taking into account that the solutions differ in their 
original constitution. 


TABLE IV 


Measured coefficients for Langmuir and Freundlich isotherms; other isotherms 
as summarized in ref. 24 were not useful 


mol Me ( 1 ) 
element | X,, K| —— 
( mol Fe ) mol 


Langmuir Zn (—) 0-39 6:8 . 10-6 
Cd (2 0-06 24 . 108 27 
Hg (— 0-03 5.10* 


Freundlich Zn (4-) 
Cd (—) 


Hg (+) 


. We have also tried an entirely empirical approach by fitting 


log,o D = a + blog [Cd] + c log [Fe] + dlog V. 


One could include in this model additional cross-terms such as [Cd]. [Fe] 
as was done by Malofeeva et al.?9). This would lead to a large, impractical 
number of parameters and automatically exclude a chemical description. 
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Furthermore, the addition of some other parameters, such as temperature, 
would require an entirely new determination of the parameters. 


8. Influence of various chemicals 


The distribution coefficients of Zn, Cd and Hg were measured in synthetic 
river water (SRW) and synthetic sea water (SSW); in KNO; (107? N) in the 
presence of KCN, EDTA and citric acid; see table V. All measurements were 
done at pH 8:5; metal concentration (in moll): Zn = 36.1075; Cd = 
8.5.1077; Hg = L:5.1075; Fe = 1:73 . 107^. 


TABLE V 


Distribution coefficients in different media 


KNO, 107? N 
SRW 
SSW 


KCN 24.1075 
KCN 24.1075 
KCN 2:4. 10-4 


EDTA 24.1076 
EDTA 24.1075 
EDTA 2-4. 1074 


0-002 


0-02 ` 
0-001 


0:24 


H,cit 2:4. 1075 
Hicit 2:4. 1075 
H,cit 2-4. 1074 


*) The numbers indicate the final molarities before precipitation. 


Little difference is observed between the usual KNO, medium and the syn- 
thetic river water: both of them have roughly the same ionic strength and the 
concentration of possible complexing ions such as C17 or PO4?"7 is sufficiently 
small. Sea water has a much higher C17 concentration. The increasingly strong 
complexation of Zn, Cd and Hg (in that order) by Cl- results in a decrease 
of D: Zn remains nearly unaffected while the decrease is largest for Hg. 

For KCN, EDTA and H,cit two effects must be considered: 

(a) The complexation of iron. It is rather weak with KCN and much stronger 
with EDTA and H,cit. In other words: iron hydroxide still precipitates at 
the highest cyanide concentration, but not in 2-4. 107^ N EDTA or 
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Hiscit: the observed D values (at these concentrations) represent merely 
some adsorption on the filter material and do not have any meaning. 

(b) The complexation of the Zn, Cd or Hg. Mercury forms the strongest cyanide 
complexes and this results in the destruction of Hg-iron-hydroxide com- 
plex, even at the lowest cyanide concentration. The effects are much less 
pronounced for Zn and Cd, where only the highest cyanide concentration 
“has any effect. 


9. Time dependence 


For all three elements the distribution coefficient was measured as a function 
of time, up to 100 hours. Once the pH was brought up to 8-5, the solutions 
were left unstirred for the desired length of time. The results are shown in 
fig. 6. There is a very slight decrease in the D values of Zn and Cd, which can 
be attributed to a reorganization of the iron hydroxide. 


logg D 
| xp: . of Zn 
E E o o o o 
15 ° 
15 
10 f f | Cd 
$ 


-10 0 7:0 20 
— logo T (hours) 


Fig. 6. Logio D for Zn, Cd and Hg in KNO; (107? N) as a function of time; pH = 8:5; 
metal-ion concentrations (mol1~+): Fe = 1.73.107*; Zn = 75.1075; Cd = 72.1077; 
Hg = 14. 1076. ; 


10. Conclusions 


It is not possible to propose at this stage a simple interpretation and descrip- 
tion of the coprecipitation mechanism which could be employed in trace- 
element separations. The ion-exchange mechanisms as described in secs 
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2.1.1-2.1.3 are thought to be the most likely reactions. The partial success of 
the Freundlich or Langmuir isotherms is not a sufficient proof that coprecipita- 
tion proceeds via ion-pair or molecular adsorption (secs 2.1.4 and 2:1.5): in 
certain concentration regions these two isotherms can fairly well approximate 
the more general mass-law equation. 

It is not likely that mixed-crystal formation, occlusion and entrapment 
(secs 2.2.1 and 2.2.2) play an important role: the strong pH dependence of D 
excludes such a mechanism almost entirely. Further evidence comes from the 
observation by Kolthoff et al.?*) that practically the same distribution ratios 
are observed when the metal is added after the iron-hydroxide precipitate has 
been formed. 


Eindhoven, September 1975 
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INTRODUCTION AND REMOVAL OF 
HYDROXYL GROUPS IN VITREOUS SILICA 


PART II. CHEMICAL AND PHYSICAL SOLUBILITY 
OF HYDROGEN IN VITREOUS SILICA 


by G. H. A. M. van der STEEN and E. PAPANIKOLAU 


Abstract 


Hydrogen dissolves in vitreous silica at elevated temperatures accom- 
panied by the formation of =Si-OH groups and an equimolar amount 
of other H-containing centres. Below 600 °C the kinetics of the reverse 
reaction is very slow and the hydroxy] concentration does not decrease 
during heat treatment. Physically dissolved hydrogen is easily removable 
at 450 °C, and the evolution of hydrogen obeys the laws of normal dif- 
fusion. Above 600 °C, the evolution of hydrogen is linearly proportional 
to the decrease of the OH concentration. Small amounts of alkali and 
alkaline-earth-metal ions may considerably influence the chemically 
bonded state of the hydrogen present in the vitreous silica. 


1. Introduction 


In general the concentration of hydroxyl groups in vitreous silica can be 


estimated by making use of the extinction coefficient (£o) and assuming the 
validity of the Beer-Lambert law: 


10 Bou M, 
[OH] = 10 Pon Mow g OH/g of glass, 


€ou Osi 


where [OH] — hydroxyl concentration (g OH/g of glass), 

Bou = optical density per mm (mm^), 

Mou = molecular weight of a hydroxyl group (g mole^?), 

&og = practical extinction coefficient (litre mole^! cm^ !), 

Qo = density of the glass (g litre" +). 
In general the absorption band of the unassociated hydroxyl vibration occurs 
at about 3700 cm^ !. 

Stephenson and Jack +) determined £o by studying the weight lost by a 
Spectrosil sample during firing in vacuo at 1000 ?C. The loss in weight, assumed 
to be water, was related to the optical density. Two experiments gave respective- 
ly 78 and 77 1 mole^! cm^! for oj. In the literature the extinction coefficient 
for water or for hydroxyl groups is used more or less arbitrarily. To avoid 
confusion, the authors will indicate in the course of the paper whether the 
extinction coefficient refers to —OH groups (eon) or to water (€4,0). Hydroxyl 
groups can be introduced in vitreous silica by melting the quartz powder in a 
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hydrogen-containing atmosphere. This has been shown in a previous publica- 
tion ?). It is shown there also that the state of oxidation of the underlying glass 
network determines whether H, or H,O is evolved during heating in vacuo. 
Because of the relatively high diffusion coefficient of hydrogen in vitreous 
silica, it is quickly removed from the glass sample as the reaction proceeds. 
The removal of H,O from the glass is a slow process compared with H, evolu- 
tion, probably due to the small diffusivity of water in vitreous silica. At high 
temperatures (> 1500 °C) hydrogen formation, due to decomposing OH groups 
may even give rise to bubble formation. 

The construction of an extraction apparatus suitable for determining very 
small amounts of hydrogen makes it possible to study the relation between the 
ABou value and the hydrogen concentration and thus to determine the “practi- 
cal" molar extinction coefficient of hydroxyl in vitreous silica. In this paper it is 
also demonstrated that small amounts of alkali and alkaline-earth metals have 
a great influence on the chemical behaviour of hydrogen in the glass. 


2. Preparation of the vitreous-silica samples 


2.1. Melting procedure 


The raw material (rinsed and milled Brazilian rock quartz) was melted in a 
molybdenum crucible in a hydrogen atmosphere at 1950 °C by a high-frequency 
technique. À more detailed description of the melting apparatus, raw material 
and melting procedures has been given in a previous publication ?). The 
vitreous-silica ingots were polished bilaterally, so that a thin plate, suitable for 
absorption and extraction measurements, remained. 


2.2. Barium addition 


Barium was added to the crystalline quartz powder in the form of a solution 
of its nitrate. In order to spread the solution as homogeneously as possible over 
the quartz powder and also to prevent the formation of concentration gradients 
during drying, such an amount of solution was added that the surface of the 
crystals was wetted but no pools were formed (about 0:5 cm? solution/g of 
quartz). After drying at 200 °C, the mixture was stirred in an agate mortar and 
fired for one hour at 1050 °C in an oxygen atmosphere. The mixture was melted 
in the same way as an undoped sample. 

The Ba content (after melting) was determined with the aid of atomic-absorp- 
tion analysis. 


2.3. Equipment for high-temperature treatments 


Some samples were fired in a vertical high-temperature tube furnace 
(PCA 10/10 (1800°C) Lab. Furnace) at temperatures between 1100 and 
1600 °C in an H;/N; atmosphere. 
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The samples were quenched in the same atmosphere as they were heated in. 
The glass plates were placed in a molybdenum crucible and the temperature 
was measured by means of a pyrometer. Samples with varying hydroxyl con- 
tent can be produced by varying the partial pressure of the hydrogen and/or 
changing the heating temperature. 

After the heat treatment the surface of the vitreous-silica plates had generally 
recrystallized. After repolishing, the IR-absorption measurement and extraction 
experiment took place. 


3. Apparatus for extraction experiments 


The extraction experiments took place in an apparatus as described schemati- 
cally in fig. 1. After IR-absorption measurement the glass plate was treated with 
a concentrated HF solution, rinsed with distilled water, and dried at 50 ?C. 
This treatment was necessary in order to remove surface contaminants. After 
drying, the sample was placed in a vitreous-silica tube A which was sealed to 
the system. 

The sample could be moved in the system by means of two magnets B1 and 
B2. The magnet inside the system was mounted in a small evacuated vitreous- 
silica capsule. Prior to the start of the experiment, the sample was pushed into 
the cold zone of tube A, oven C was heated up (1000 °C), and the system was 
evacuated by means of a mercury-diffusion pump D until a pressure of 
107? Torr was reached. Valve E was then turned off. The very slow decay of 
the high-vacuum meter indicated that leakage from the system was very small. 
The oven was now cooled down and the sample pushed to the centre of the 
oven. The temperature was raised again and the extraction experiment started. 
By means of a Toeppler pump, F, the gases were concentrated in volume G. 
A float in volume G prevented the concentrated gases as well as a small amount 
of mercury from flowing back into the Toeppler pump. The pump concentrated 
the gas at a speed of about two strokes a minute. Concentrating the gases was 
performed automatically by detection of the mercury level in tube H with a 
light source and a photoconductivity cell. During every stroke about 50% of 
the gases present in the system was transported to volume G. During the last 
stroke the mercury level was raised to the top of capillary I. Accordingly the 
gas was forced into a channel drilled in valve J. By turning this valve, the 
channel was connected with the circuit of a gas chromatograph. 

The gas chromatograph was provided with a column about 7 metres in 
length, filled with molecular sieves 13 X (30-50 meshes). The column tem- 
perature was kept constant at 50?C, while the carrier gas (Ar) flowed at a 
speed of 30 ml/min. At the end of the column a catharometer (type Gow- Mac 
10-285 WX) was mounted. The detection limit for the system described above 
was 1077 cm? (N.T.P.) H3. 
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Outlet 


Fig. 1. Apparatus for extraction experiment. 


A = vitreous-silica tube, 1. to evacuate the channel, 
B1, B2 = magnets, 2. to fill the channel with the extracted 
C = oven, gas, 
D = mercury-diffusion pump, 3. to connect the channel with the circuit 
E = valve, of the gas chromatograph, 
F = Toeppler pump, K = N;-cooled trap 
G = glass capsule provided with a L = float, 
float (L), M = mercury, 
H = glass tube N = light source and photoconductivity cell 
I = capillary, (connected with P), 
J — valve, with a channel drilled in it. O = sample, : ^ 
This valve can occupy 3 positions: P - magnetic stopcocks connected with N. 


To operate the apparatus in practice, more valves and high-vacuum meters are connected 
to the system. 


4. Equipment for absorption measurements 


The infrared-absorption measurements were carried out with a Jasco IRA-2 
spectrophotometer (4000-400 cm^7 +). For measurements in the near-ultraviolet 
and visible-light regions (50000-4000 cm-+) a Beckmann DK-2 was used. Dif- 
ference spectra were recorded by placing one of the samples in the reference 

! beam of the spectrophotometer. 
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5. Results and discussion 


5.1. IR-absorption measurements 


Faile and Roy ?) introduced large amounts of OH groups into vitreous silica 
by means of neutron irradiation in a hydrogen-containing atmosphere. Their 
experiments indicated that when the OH concentration in the sample is very 
great, the absorption spectrum is extended by two bands (2250 and 4520 cm^ !). 
The first band was attributed to the fundamental Si-H stretching vibration and 
the other to its first overtone. 

It is known from “Silane” chemistry that the fundamental Si-H stretching 
vibration occurs as a rule at frequencies between 2300 and 2100 cm^!, with 
corresponding overtones at between 4600 and 4200 cm" *. Si-H bonds are also 
presumed to occur in silicon-device technology, where thin SiO; layers are 
grown at the silicon surface by thermal or anodical oxidation ^). 

In our preparation technique it is equally possible that Si-H bonds are 
formed: 


2 SiO, + H, — =Si-H + zSi-OH, (1) 
where 


T 
= SiOH :: Si0,s0H:: $-Si-oH. 


€ 


The OH concentration in our glass, however, is some orders of magnitude 
lower than the concentration in the irradiated glasses that were produced by 
Faile and Roy. For accurate measurements it is necessary to have samples with 
a relatively high Si-H concentration or very thick samples. The preparation of 
thick, bubble-free samples is hampered by the fact that quenching becomes less 
effective and bubbles, filled with hydrogen, are formed during cooling. 

We succeeded in preparing a thick, practically bubble-free sample by melting 
the quartz powder in a 25% H,-75% He mixture. After cutting and polishing, 
a plate of about 14-4 mm thickness remained. One of the two halves of the 
plate was fired for 60 hours at 1050 °C in vacuo. 

Figure 2 gives the absorption spectra of both samples as well as the dif- 
ference spectrum. 

An absorption band of a low intensity was recorded at 4535 cm^! (untreated 
sample). Unfortunately, vitreous silica itself has some very strong absorption 
bands in the 2200-cm^! region. This probably explains why we were unable 
to record an absorption band at 2250 cm7?. 

We have not yet been able to decide whether the 4535-cm™+ absorption band 
should be attributed to an Si-H vibration or to a vibration of an asymmetric 
tetrahedron containing a hydroxyl group (as proposed by Adams 5)), or to both. 


^ 
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Fig. 2. Absorption spectra of (a): vitreous silica melted in a 25% H5-75 7; He atmosphere; 
(b) sample (a) fired for 60h at 1050 ?C in vacuum; (c) difference spectrum ((a) — (b)). 
Thickness of the samples 14-4 mm. 


5.2. Separation of physically dissolved and chemically bonded hydrogen by means 
of extraction 


The effect of temperature treatment on the Boy value of a freshly melted 
sample can be described as follows. When a freshly melted vitreous-silica sample 
with a Bou value of 70. 107? mm^! was fired for 1000 hours in vacuo at 
450 °C, the Box value did not decrease detectably during this treatment. When 
the sample was fired at 750 ?C for the same period of time a slow decrease of 
the Box value was observed. At 1000 °C the Boy value was reduced to about 
1.107? mm^! within a few hours. 

When a similar sample was extracted in an apparatus as described in sec. 3, 
the results were as follows. At 450°C the sample released 1:97 .107? cm? 
(N.T.P.) H5/g of glass within 24 hours. After 24 hours, hydrogen evolution 
had stopped almost completely. When the temperature was raised to 700 °C, 
hydrogen development restarted slowly. At 1000 ?C the evolution of hydrogen 
was very fast during the first hours of extraction. Between 600 and 1000 °C, 
12:8. 107? cm? (N.T.P.) H;/g of glass was released and the Boy value went 
to zero. 

The observed qualitative effects of firing on H, evolution and on the Boy value 
are given in table I. These results confirm the assumption that at lower tem- 
peratures the chemical interaction between the glass network and the diffusing 
hydrogen molecules is negligibly small. 

If there is no appreciable interaction between the physically dissolved H, and 
the glass network, the evolution of gas must obey the laws of normal diffusion. 
As a first approximation the small flat plates (thickness of the plate about 
3 mm, diameter 20 mm) were taken to be a semi-infinite plane sheet. The 
solution of Fick's equation for this system then becomes: 
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TABLE I 


Qualitative effect of heat treatment on hydrogen development and on the 
Box value of an undoped and a Ba-doped vitreous-silica sample 


- temperature 


Co 


evolution of H, by | Boy value 


undoped Ba-doped undoped Ba-doped 
sample sample sample sample 
450 fast fast 


stops after stops after 
24 hours 24 hours no change no change 


500 none none no change no change 


600 very slow fast very slow decreases at 
stops after decrease the same rate 
130 hours as H, devel- 
opment, until 
zero 


700 decreases at | remains zero 
the same rate 
as H, de- 
velopment 
750 remains zero 
1000 remains zero 


h Sx 
1 8 1 2 1 2 
e [edax o X arl) D 1 (2) 
h n? Qn + 1)? h 
n=0 


where € = mean concentration, 
Co = initial concentration, 
h = thickness of the sample, 
D = diffusion coefficient, 
t = time, 
x = distance (0 < x< h). 


A more detailed treatment of this mathematical problem is given in the publica- 
tions by Jost ) and Crank ?). 

With the aid of computer techniques the extraction data of the sample can 
be compared with the calculated values and the diffusion coefficient adapted 
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T ick's law 
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0-0 05 1-0 
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Fig. 3. Typical outdiffusion curves. Firing temperature 450 °C. x Vitreous silica, D = 
4.31.1077 cm?s~1, h = 0-221 cm. O Vitreous silica, 0-1 mole% BaO, D = 3:34. 107? 
cm? s7 1, h = 0:230 cm. Calculated Fick’s-law curve (drawn) is shown for comparison. 


until the theoretical curve shows an optimum fit with the experiment (cf. 
fig. 3). The diffusion coefficient corresponding to the best-fitting curve is 
4:31 . 1077 cm? s7!, a value which is quite well comparable with the results 
of Lee 14) (D45o*c = 4°54. 1077 cm? s7 1). 

Most probably the concentration of physically dissolved hydrogen in the 
samples is not equal to the saturation hydrogen concentration at 1 atmosphere. 
During the finite cooling time, hydrogen will be formed as a result of decom- 
posing hydroxyl groups. As shown in an earlier publication ?), slow cooling 
may even give rise to bubble formation. The pressure in these bubbles may 
exceed the atmospheric pressure considerably (up to 10 atmospheres in a 
bubble with a diameter of about 200 um). 

A sample with a different thermal history was prepared as follows. A freshly 
melted sample was reheated for 16 hours at 1550 ?C in a hydrogen atmosphere 
and quenched. As a result of this treatment the Boy value was reduced to 
43 . 107? mm^!. 

When this sample was extracted at 450 °C, 0:71 . 107? cm? (N.T.P.) H; per 
g of glass was evolved. 

In our opinion these experiments indicate that the concentration of physically 
dissolved hydrogen is strongly dependent on the thermal history of the samples. 


5.3. The practical molar extinction coefficient for OH. groups in vitreous silica 


When the physically dissolved hydrogen is removed from the samples at 
450 °C, the chemically bonded modification will still be present. Samples of a 
series having different Bo, values were extracted at temperatures between 600 
and 1000 ?C and the results are given in fig. 4. The straight line indicates that 
Eon is independent of the concentration. 

In order to calculate the molar extinction coefficient for OH groups from’ 
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Fig. 4. Extracted amounts of chemically bonded hydrogen (in the form of Si-OH and Si-H 
groups) as a function of the Boy value. Before plotting, the amount of physically dissolved 
hydrogen (extracted at 450°C) is subtracted from the total amount. Point e refers to a Ba- 
doped sample, extracted at 600 ?C. 


this figure, some additional information is necessary, for it is important whether 
one H, molecule produces one or two hydroxyl groups. For instance 
Sitt 2 O?~ + 4 H, — Si?*O?-OH (suggested by Bell 9) (3) 
xn 
reduced silica 
with hydroxyl 
or ?) 
3 SiO, + H; — 2 =SiOH + SiO (4) 
MÀ M -— 
dissolved 


are two reaction mechanisms where one H; molecule produces two OH groups. 
Another possibility is the following mechanism: 


=Si-O-Si= + H, > =SiOH + H-Si=, (5) 


where one H, molecule forms one OH group and one Si-H group. Assuming 
that the OH groups are formed according to reaction (3) or (4), the extinction 
coefficient will come to £oj, = 29-0 1 mole~1 cm~ +, whereas, if the OH groups 
are formed according to reaction (5) the extinction coefficient will be twice as 
high, €94 = 58:0 1 mole‘ cm^!. The latter value is not too far from the ex- 
tinction coefficient determined by Stephenson !) egy = 77:5 1 mole^! cm !. 
In the authors' opinion there exists a strong indication that equimolar amounts 
of Si-OH- and Si-H-type groups are formed by reaction (5). Assuming that 
this reaction mechanism is valid, some phenomena can easily be explained, viz.: 
(a) The relationship 
K Pa, = [=SiOH] [=SiH] = [OH]. 


is consistent with the experimental results °). 
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(b) In silicon-device technology thin vitreous-silica films can be grown on silicon 
plates by for example thermal oxidation. Beckmann *) measured the infra- 
red-absorption spectra of these films by internal-reflection spectroscopy and 
attributed absorption bands at 2250 and 2350 cm-^! to the fundamental 
Si-H .. O and to Si-H stretching vibration respectively. 

(c) The shape of the etchant count profiles for SiO, films formed by thermal 
oxidation with titrated water 1°) have been explained by Breed ++), making 
use of this reaction mechanism. 

(d) The results of recent H-D-exchange experiments support the assumption 
that considerable amoünts of hydrogen must be present in the glass in a 
form other than hydroxyl groups ?). 

In our previous paper?) it was assumed that the reaction products formed 

within a bubble by the reaction 


SiO; + H, — SiO + HO 
gas gas gas 


were dissolved in the network according to 


2 SiO, + SiO + H,O > 2 zSiOH + SiO; 
gas X gas diss. diss. 


if this process is followed by some rearranging reaction: 


2 =SiOH + SiO > =SiOH + =SiH + SiO, , 
diss. diss. diss. diss. 


the net result will be the same as for the reaction (5). 
Another possibility is the formation of a strong complex between one 

hydroxyl group and the Si?+-containing centre: 

=SiOH SiO + =SiOH. 

eT 
It is known that strong complex hydrogen bridges can shift the absorption 
bands to lower frequencies. If the complex is strong, the absorption band may 
even shift to ffequencies where the glass itself has some very strong absorption 
bands. As a consequence of this, only half of the OH groups would be visible 
at 3700 cm-t. The difference between an Si-H group and a strong complex 
may be a gradual one. Because of the low dielectric constant of the glass (as 
compared with water), complex formation is quite possible even at high tem- 
perature. : 


5.4. Vitreous silica with BaO dopes 


Scholze !?) and Gótz !?) studied the influence of the addition of glass modi- 
fiers in soda-lime-silicate glasses on the behaviour of the OH absorption bands 
in the 3300-cm^ ! region. To study the influence of modifiers on the behaviour 
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of H, in vitreous silica, we prepared samples in the manner described in sec. 2.2. 
Experiments showed that the effect of the addition of alkali and alkaline-earth- 
metal ions was roughly the same. We chose Ba-doped samples for a more 
extensive study, because Ba addition seemed to be fairly well reproducible 
(low evaporation loss). The Boy value of the Ba-doped samples seems to be 
strongly dependent on the Ba concentration. In fig. 5 the Bo, value is given 
as a function of the Ba concentration. 


-l 


——> figy value .10? (mm) 


0 0-05 0-10 0-15 020 
—— Ba concentration (mole 9^) 


Fig. 5. on value as a function of the Ba concentration in vitreous-silica samples. The samples 
were melted for 15 min at 1950?C in a hydrogen atmosphere. 


In the case of low Ba concentrations the effect on the fo value is particularly 
striking. Except for the absorption bands due to-OH and Si-H vibrations, the 
spectrum of a doped sample looks similar to the spectrum of an undoped 
sample. The difference spectrum of a doped and an undoped sample does not 
show any additional absorption bands either. The extraction data, however, 
differ markedly from the data for undoped samples. 

When a freshly melted sample containing 0-1 mole% BaO was extracted at 
450 °C, 1-38. 107? cm? (N.T.P.) H;/g of glass was evolved. This amount is 
comparable with the amount that is released from undoped samples. The 
diffusion coefficient is slightly lower than the diffusion coefficient of H, in an 
undoped sample (D450°c = 3:34. 1077 em?s-! and Dagsoe¢ = 437.1077 
cm? s^, respectively) (cf. fig. 3). 

A second experiment, carried out with a sample of about half the thickness, 
gave the same result. In contrast with the undoped samples, H, development 
starts again at 600 °C, while the Boy value decreases simultaneously (Afoun = 
12.107? mm^ i, 218. 107? cm? (N.T.P.) H;/g of glass). This amount of H, 
evolved at 600°C seems to be quite well predictable with the aid of the 
curve in fig. 4 (the experimental point has been added in that figure). 
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When the sample is “empty” with respect to physically dissolved H, and 
-OH groups, H, development stops. When, however, the temperature is raised 
to 1000 °C, H, development starts again (5-85. 107? cm? (N.T.P.) H,/g of 
glass). 

This leads to the conclusion that there must be some other hydrogen-con- 
taining centre, rather than the normal -OH and Si-H groups in the glass, 
which cannot be visualized with the normal IR-absorption techniques. In 
0-1-mole %-doped samples most of the hydrogen is in fact present in this un- 
known chemically bonded state (table IT). The qualitative effects of heat treat- 
ment on the hydrogen development and the Boy, value are given in table I. 


TABLE II 


Extraction data for a 0-1-mole 7; BaO-doped vitreous-silica sample 


amount of H, 
extracted in 


cm? (N.T.P.)/g 
of glass 


temperature 


CO 


remarks 


Bou value remains 
12.107? mm=? 
Box value decreases 
until zero 
Bou value remains 
zero 


600 218 . 107? 


700-1000 5:85. 107? 


6. Discussion and conclusions 


(1) Physically dissolved hydrogen can be removed separately from the chemi- 
cally bonded hydrogen by firing the vitreous silica at 450 °C in vacuo. 

(2) Hydrogen development at 450 °C obeys the laws of normal diffusion with 
Da4so*c = 437. 10-7 cm? s~?. 

(3) The amount of physically dissolved hydrogen present in a sample is strongly 

dependent on the thermal history of the sample. During cooling (or quench- 
ing) an unknown amount of OH groups and Si-H groups will decompose 
and form H,. 
It may be possible to study the solubility as well as the diffusivity of H, 
in vitreous silica at temperatures below 550 °C by making use of the ex- 
traction technique described. Above 550°C the data are clouded by a 
chemical interaction between the diffusing H, molecules and the SiO, net- 
work. 

(4) When a sample, which is “empty” with respect to physically dissolved H,, 
is heated up to 600°C, H, development restarts very slowly, while the 
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Box value decreases simultaneously. In the temperature range between 750 
and 1000 °C, H, development is very fast, and the Boy value decreases at 
the same rate. This leads to the conclusion that decomposing -OH (and 
Si-H) groups act as H, sources. A value of 58 1 mole^! cm^! is found 

for £o. . 

- (5) In BaO-doped samples three types of hydrogen can be distinguished. 

(a) Physically dissolved hydrogen, which leaves the sample at 450 *C, obey- 
ing the laws of normal diffusion with D459«c = 3:34. 107" cm? s^ !. 
The diffusion coefficient is somewhat lower than the diffusion coefficient 
of H, in an undoped sample. 

(b) Chemically bonded H, in the form of Si-OH and.Si-H groups. The 
extinction coefficient for OH groups is the same as the extinction coeffi- 
cient for OH groups in undoped samples. The thermal stability of the 
OH and Si-H groups is affected by BaO addition. 

(c) Most of the hydrogen is bonded in a more stable form. So far it has 
not been possible to visualize these H;-containing centres with the aid 
of normal IR-absorption measurements. 

(6) The effect of BaO addition on the absorption band at 3700 cm~? is much 
stronger than the effects measured by Scholze +°) and Götz !?) in soda- 
lime-silicate glasses. The formation of a new absorption band at about 
2800 cm~ + could not be detected either. One of the possible ways of explain- 
ing this is to assume that strong hydrogen bridges are formed, so that the 
frequency for OH stretching shifts to the 1600-cm^! range. 

In general, it is no use trying to calculate the total H, (or H20) content of 

vitreous silica with the aid of the OH concentration, measured by means 

of IR absorption, without knowing the other factors. 

The H concentration in vitreous silica is made up of the following terms: 


10 fo, 10 f. 
[H] —2x = + (1 — x) a + [A pnys. diss. + 2 [HzO] nys. diss. + 


Ost ou Og! ou 


-HB] chem, bonded other than =SiOH and =SiH 


wheré x is the fraction of -OH groups that is compensated by an equal 

amount of Si-H groups. 

(a) x can be estimated by firing the sample at 1050 °C in vacuo for 24 hours 
and measuring the Boy value. 

(b) [H]pnys. ais, may vary considerably and is determined mainly by the 
thermal history during cooling. Extraction of the H, at 450°C gives 
the most direct information about the concentration. 

(c) [HzO] nys. aiss. is probably negligibly small. 

(d) [H] chem. bonded, ete. This concentration is strongly dependent on the 
alkali content and may account for up to about 80% of the total 
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hydrogen content. Hence it may in some cases be the most important 
H, source. 

(8) The anomalies of the input and decay portions of the permeation data 
found by Lee !*15) can probably be explained if the reduction reaction 
between H, and SiO, is assumed. In the temperature range in which Lee 
found the anomalies (> 600°C), the amount of OH groups formed due 
to this reaction is too small to measure with the aid of IR-absorption 
techniques. The much more sensitive extraction experiments show the pres- 
ence of chemically bonded hydrogen very clearly. The anomalies in the 
results of the H-D-exchange experiments, carried out by the same author, 
are probably attributable to the presence of Si-H groups that participate 
in the exchange reactions. The equilibrium constant for the reaction 
2SiO, + H, = =SiOH + (^ SiH") seems to be strongly temperature- 
dependent. We intend to publish the results of these investigations in the 
near future. 
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ETCHING OF A1,03 IN AN HC1-H, 
AMBIENT AT 1200-1500 K 


by J. BLOEM and Bieneke BOSMAN 


Abstract 


The stability of A120, in an ambient containing halides has been studied 
for the case of gaseous HCl in hydrogen at 1 atm total pressure and at 
temperatures of 1200-1500 K. A rapid attack is noted and it is con- 
cluded from the experiments that AICI, is the main reaction product. 
Addition of H;O to the gas mixture strongly inhibits the etching reac- 
tion for H20 concentrations above a certain level which is determined 
by the concentration of water already present, as H2O is produced by 
the etching reaction itself. The reaction is diffusion-controlled and the 
etch rate is the same for single-crystalline sapphire and polycrystalline 
sintered alumina. A more accurate value of the enthalpy of formation 
of AICI, is given, AH?,,58(AICl;, g) = —88 + 3 kcal/mole, against 
—75 + 20 kcal/mole obtained before. This means that a reasonably 
accurate value of the vapour pressure of AlCl, can be given as a 
function of temperature for various HCl partial pressures in equi- 
librium with solid A1203. 


1. Introduction 


Etching of Al;O; has been studied mainly at nearly room temperature for 
the detection of dislocations '-?*) or the preparation of damage-free surfaces 
for silicon epitaxy on sapphire 45-6). 

Gas-phase etching of sapphire has been reported by Manasevit and co- 
workers 78-9?) who studied the reaction of sulphur fluorides and fluorinated 
hydrocarbons on alumina. In the growth of silicon on sapphire it was observed 
that best results were obtained by SiH,, the use of Si halides resulting in attack 
on the. substrate by the halide 19:17). However, the etching of Al,O, by HCl 
has not been reported. This system has now been chosen as an example of 
high-temperature corrosion of refractory oxides. 


2. Experimental 


The etching experiments were performed in a water-cooled fused-silica epi- 
taxial reactor tube in which a graphite susceptor is placed. This susceptor is 
RF-heated to the required temperature (1200-1500 K). A calibrated optical 
pyrometer is used to determine the surface temperature of the susceptor. 

Polished slices of sapphire, (1012) orientation, densily sintered alumina 
(grain size ~ 20 wm) and hot-pressed alumina (grain size e 1 um) are placed 
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RF coil 


o o o o o o o o o [*] o 


eo 
Quartz tube 


Stagnant layer 


LLL LLL LL B, , LL, LL LL In 
Alo 03 slices 


Fig. 1. Apparatus used for the etching experiments. 


on top of the susceptor (fig. 1). The slices are 300 um thick, precleaned by 
heating in an H,SO,-H,O, solution, thoroughly rinsed in deionized water 
and dried. 

Hydrogen gas with an impurity content of 1 ppm H,O and O, is passed into 
the reactor at nearly atmospheric pressure. The gas speed at room temperature 
is 50 cm/s. Under these conditions a stable stagnant layer is formed above a 
heated susceptor 17). Diffusion of the reaction partners through this stagnant 
layer constitutes one of the steps in the etching process. 

The hydrogen gas entering the reactor is loaded with HCI gas, concentrations 
ranging from 0-10 volume percent. CO; can be added simultaneously, in order 
to obtain a desired H,O concentration. In the presence of CO, the reaction 
that takes place at the hot substrate surface 


CO, + H, 55 CO + HO (log K, (1500 K) = 0:656) (1) 


is shifted far to the right, and it can be safely assumed that the H,O concen- 
tration near the substrates is equal to the input concentration of CO,. The 
influence of an increased oxygen activity in the system can thus be studied. 

The etch rate of the alumina slices was determined by measuring the weight 
loss on etching as rounding off on the edges is not observed, the surface remains 
flat and hardly any attack at the rear of the slices was noticeable. The exposed 
surfaces are etched homogeneously with a slight preference for attack at grain 
boundaries. 


3. Experimental results 


Firstly it was ascertained that at the temperatures used there was no ap- 
preciable attack of the Al;O4 samples by the ambient gas, hydrogen. In hydro- 
gen (as also found by Zeveke et al.5)) the etch rate appeared to be smaller 
than 7.1074 mg/cm? min at 1500 K. This value is small compared to the 
etch rates in HCI ranging from 107? to 1 mg/cm? min as deduced from fig. 2. 
The weight change is found to be linear with time indicating a constant etch 
rate, i.e. no solid reaction products are interfering by blocking the reaction 
path. 
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——» Weight loss (ng/cm2) 


—» 1 (min) 


Fig. 2. Weight change of sintered alumina per cm? free surface on etching with HCl at 
1500 K. 


Figure 3 shows the etch rate R (mg Al,O3/cm? min) as a function of the 
concentration of HCl in the ambient H;. The slope of log R vs log pac is 
found to be 0-8 + 0-1. 

In fig. 4 the temperature dependence of the etch rate for puc; = 107? atm 
is depicted; the slope of log Rvsl/T indicates a reaction enthalpy of 
25-3 + 1-1 kcal/mole. No significant difference is found for the etch rates on 
different substrates. 

Introduction of H,O should decrease the etch rate; to this end CO, has 
been added to the gas mixture in concentrations up to 0-477. 


—+Log etch rate (mg/min cm?) 
| 


Ù 


=3 -2-5 -2 -7:5 - 
—> log Dci (alm) 


Fig. 3. Etch rate of sintered alumina as a function of HCI partial pressure at 1500 K with 
hydrogen as ambient gas. 
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o sapphire 
x sintered alumina 


— Log R (mg/cm? min) 


Fig. 4. Temperature dependence of the etch rate of Al2O3 for puc, = 107? atm in hydrogen. 
A least-squares analysis gives values of the reaction enthalpies of sapphire, sintered alumina 
and hot-pressed alumina as AH, = 24-2, 26:1 and 25-4 kcal/mole, respectively. 


Its influence on the etch rate is shown in fig. 5, again with a constant value 
of pyc, (107? atm). For CO, (H,0) concentrations exceeding 4-6. 107^ atm 
the etch rate begins to decrease with increasing values of pco,. The slope of 
log R vs log pco, is equal to —1:5 + 0:5. 

On the other hand, for a fixed value of water-vapour pressure the etch rate 
changes with pyc; as shown in fig. 6. The etch rate without added H,O as 
given in fig. 3 is included for comparison. It shows that the higher the value 
of Puc: the closer the two curves approach each other. The slope of the new 


—+log etch rate (mg/cm? min) 


_7 > log Peo; (atm) 


Fig. 5. The influence of the addition of CO, (or H20} on the etch rate of alumina in HCI 
(Puci = 1072 atm). 
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x sintered alumina 
A hot-pressed alumina 


—>Log etch rate (mg/cm? min) 


M -2 d 
—>Log Pyc; (atm) 


Fig. 6. Variation of the etch rate of alumina with pyc, with and without added water. Upper 
curve without added water, lower curve with Poo, (= Pn 20) =2-5.1073 atm, T = 1500 K. 


curve of log R vs log puci (with a fixed value of py,o) is not constant, being 
1-2 at the highest values of pyc, used. 


4. Discussion 


Etch rates 


In the etching of Al,03 with HCl only gaseous reaction products are formed; 
the various steps in the chain of reactions can then be given as follows: 

(a) diffusion of HCl to the Al;O; surface (gas-phase diffusion), 

(b) adsorption of HCl on the A1;0, surface, 

(c) surface diffusion of HCI to active surface sites, 

(d) formation of reaction products, 

(e) surface diffusion of reaction products, 

(f) desorption of reaction products, 

(g) diffusion of reaction products away from the surface. 
Steps (a) to (c) are not thought to be rate-limiting as only a small part of the 
HCI is used in the process. The temperatures are high enough to give rapid 
reaction and desorption rates (steps (d), (e) (f)). It is therefore likely that the 
last-mentioned step (g), the diffusion of reaction products away from the sur- 
face, is the rate-controlling step. This is in accordance with the observation 
that within the experimental error the etch rate is independent of orientation 
or grain size of the alumina crystallites. The consequence of this situation is 
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that near the reacting surface equilibrium concentrations of HCl and reaction 
products are present. 

In the epitaxial reactor a steep temperature gradient is present perpendicular 
to the heated surface. This temperature gradient generates a stable stagnant 
layer of a few mm in thickness above the susceptor +”). Diffusion of reaction 
products through this layer leads to an etch rate (R) proportional to the flux j 
of reaction products away from this surface, given by 


OPaici, 


Jaca, = Daca, M (mole/cm? s) (2) 


or approximately 


PAtici, (surface) — Paici,(sas) 


j= Daci, ™m $ 
where m is the total gas density (mol/cm?), 6 is the thickness of the stagnant 
layer (cm), Dici, is the diffusion coefficient of the species AICI, in hydrogen 
(cm?/s) and pac, is the partial pressure of this species; x is the direction per- 
pendicular to the surface. As paic, (gas) is nearly zero, the flux and, therefore, 
the etch rate will be directly proportional to p4;ci, near the surface, i.e. the 
equilibrium vapour pressure of reaction products in the surface area. 


5. Possible reaction products 


According to the data given in the JANAF thermochemical tables !?) pos- 
sible reaction products are AlCl, AICl,, AlCl, and AIOCI according to the 
following reactions: 


log K, (1500 K) 
Al;0, + 2 HCI + 2 H; 552A1Cl +3H,0 —19-6, (3a) 
Al,O,; +4HCI + H, $ 2AlICl, +3H,0 —13:8, (3b) 
Al,O3 + 6 HCl $5 2 AlCl, +3H,0 —10°6, (3c) 
A1,0, + 2 HCl 4 2Al0C1+ H,O —18:5. (3d) 


The values of K, from ref. 13 at 1500 K are given. For example K, for 
reaction (3b) is given by 


= (Paci » s (2,9)? 


4 
(Puci)* Pn; S 


p 


In the present case py, = 1 and from eq. (3b) it follows that 


—3 
Pazo = = Pai 
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Introducing this into eq. (4) leads to 


Paiciz = (sy? Ko (Puci) ^^. (5) 


In the same way the partial pressures of the other reaction products can be 
calculated as a function of pyc, and temperature, giving 


Pac = (4)? Ke (Puc)?^, 
Paiciz = (39/5 Ke!" (Puci) $^, 


Paioci = 2 P d (Puci)??. 


6. Analysis of experimental results 


(i) From the foregoing it is concluded that the etch rate is proportional to the 
equilibrium vapour pressure of the main reaction product. According to 
eq. (5) for AICI, as main reaction product the etch rate should vary with 
Puc: a R c puat and the slope of log Rvs log pyc, is expected to 
be 4/5. For AICI, AICI,, AlCl, and AIOCI the expected slopes of 
log R vs log puc, are 2/5, 4/5, 6/5 and 2/3 respectively. This is tabulated 
in table I. 

(ii) From the JANAF tables the value of the enthalpy of the reactions (3a) 
to (3d) can be extracted. The temperature dependence of the etch rate is 
given by the temperature dependence of the partial pressure of the reaction 
products, this being 4/5 for reactions a, b and c and 4 A7/3 for reaction d. 


TABLE I 


Calculated and experimental effects of the etching of Al;O4 with HCl in H, 
for AICI, AICl;, AlCl, and AIOCI as reaction products 


reaction log R vs log R vs log R vs AHg 
product log Puci log Puzo log Puci (kcal/mole) 
| (constant | (constant 
Puci) Puzo) 
AICI 46:4 + 0-4 
AICI, 30-7 + 8 
AlCl; 14-4 + 0-7 
AIOCI 71:3 + 3:3 
exp. 253 + l1 
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These values are also indicated in table I together with the experimental 
value. 

(iii) Introduction of H,O (via CO;) is expected to decrease the etch rate as 
H,O is one of the reaction products. For every etch rate a certain partial 
pressure of HO will be formed at the surface, the value can be calculated 
in the same manner as that of pci, (eq. (4)). A decrease in etch rate can 
only be expected if the concentration of additional water vapour exceeds 
this equilibrium value (compare fig. 5). It can be deduced from eqs (3a) 
to (3d) that for a constant value of pyc, the slope of log R vs log puo 
should be —3/2 in the case of reactions (3a), (3b) and (3c); the slope will 
be —1/2 for reaction (3d) (AIOCI as main reaction product, c.f. table I). 

(iv) Similarly a change in pyc, for a constant value of added H,O will give 
slopes of log R vs log puc, as indicated in table I. 

In the experimental results given in figs 5 and 6 the lowest etch rates (at the 
lower values of paci) are of the same order as those found for the reference 
slices in pure hydrogen (7. 1074 mg/cm? min). It can be expected, however, 
that during etching in hydrogen the introduction of H,O will also decrease 
the etch rate, so that interference in the present experiments is not likely. 

It is seen from table I that AICI, will probably be the main reaction product, 
the slope of log R vs log puc; of 1-2 being seen as an intermediate value be- 
tween the slope without additional water (0-8) and with a sufficient surplus of 
water to give the full value of 2. 

Thermochemical data of AICI, AlCl, and AIOCI are known to a fairly high 
degree of accuracy. Values of AICI, however, are interpolated between those 
of AICI and AICI4. Under the experimental conditions in discussion here, this 
gives nearly equal concentrations of AICI, AlCl, and AICI, at the surface of 
A1,0,; the value of Paroc, however, is much lower (see table II). 

The experimental data point to a greater prominence of AICl;. The tem- 
perature dependence of the etch rate gives an activation energy of 25:3 + 1-1 
kcal/mole, giving a value of 4H for reaction (3b) as 127 + 6 kcal/mole, against 
a value of 154 + 40 kcal/mole according to the JANAF estimate. Given the 
good accuracy of all other constants for the reaction partners in eq. (3b), the 
difference will be caused by a different JH, value of AlCl, at 1500 K, i.e. 
—93 +3 kcal/mole against —80 + 20 kcal/mole (4H9,,, = 88 + 3 kcal/ 
mole). This difference points to an AlCl, molecule that is much more stable 
than expected from interpolation between AICI and AICI;. 

Farber and Harris 14) found for the reaction 


AlCls(g) + Ald) 5 2 AICI,(g) 


a value of AH, = 9-9 + 9-0 kcal/mole at 298 K from a third-law analysis and 
a second-law value of —35-7 kcal/mole. The large deviation suggests temper- 
ature-dependent errors 55). The value of AH;°(AlCl,, g) adopted here leads 
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to AH, = +5 kcal/mole at 298 K and —14-7 at 1500 K for the reaction studied 
by Farber and Harris. 

From the present data we derive a value of log K, for reaction (3b) at 
1500 K of —9-8, giving and equilibrium value of 


Paici, = 2:2. 1074 atm at 1500 K and puc, = 107? atm. (6) 
For comparison the various vapour pressures are collected in table II, including 
the value of p4;c;, derived from the present experiments. . 
TABLE II 


Partial pressures of some gaseous Al-Cl compounds at 1500 K in equilibrium 
with solid Al,O3 in hydrogen loaded with HCI and H,O (present data for 
AICI, are given in parentheses) 


Puc: = 107* atm 
Pazo = 107^ atm 


log PAici —78 
log Paici; —6-9 (—4-9) —8-9 (—6-9) 
log Paicis —83 —11:3 
log Patoci —10:3 —11:3 


Two other methods are available for evaluating the vapour pressure of AICI,: 
firstly the etch rate can be analysed to give an approximate value of Pici, !?:1€), 
secondly the break in the curve of fig. 5 gives evidence of the partial pressures 
at the etching surface. 

The etch rate of Al;O; can be calculated from the flux of reaction products 
away from the reacting surface +216) giving 


Tu 2 Daci Paici2 5 — T, 2) (mole Jom? s) (7) 
ô R, To? In (TJ/T,) 


in which Dc, is the diffusion coefficient of AICI, in H, at room temperature, 
taken to be 0-25 cm?/s according to Shepherd +7), T, is the substrate tempera- 
ture — 1500 K, 7, is the gas temperature outside the stagnant layer, taken to 
be 1000 K, ô = 0-3 cm !?), R, is the gas constant = 82 atm cm?/mol degree 
and T, = 300 K. With the molecular weight of Al,O3 (M = 102), considering 
that two moles of AlCl, have to be transported per mole Al,0,, one obtains 
for the etch rate 
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R = 5.10? paci, (mg/cm? min). (8) 


In fig. 3 for pyc) = 107? atm the etch rate is found as 7. 107? mg/cm? min; 
with eq. (7) this leads to pA;c;, = 1:4. 107^ atm. This value is in reasonable 
agreement with the former estimate Paic, = 2:2. 1074 atm (eq. (6)). 

The other estimate is found from the experimental results depicted in fig. 5. 
The introduction of CO; leads to a reduction of the etch rate by the production 
of H,O at the alumina surface. The etch rate will decrease as soon as the addi- 
tional water-vapour pressure exceeds the equilibrium value of py,o. 

For puc; = 107? atm the equilibrium value of paici, is expected to be 
2.1074 atm, the equilibrium value of py,o is thus expected to be 3. 1074 atm 
(eq. (3b)). It is seen from fig. 5 that the break in the curve is present for a value 
of pco, of 4. 107^ to 6. 107* atm. 

The spread in the above estimates leads to a closer inspection of the equi- 
librium values of the reaction partners, e.g. for the reaction 


CO, + H; £5; CO + HO 


Pu,o at the surface is expected to be equal to the input value of pco, (= Pco; 
in the main gas). 

In the stagnant-layer model, however, CO; has to diffuse to the Al,O; sur- 
face, the flux of CO; being 
Pco2(s) — PCco»(surt) 


6 
In the limit of R = 0, the flux of H,O away from the surface is given by 


Jco; € Dco, 


Dus;0(surt) — ZH320(9) 
ó 


For the oxygen-mass balance in the steady state conservation of mass gives : 


Juo € Duzo 


2 ico, = ju;o + jco 
for the carbon balance 


Jco; md Jeo 
thus, by subtraction, we also have 
Jco; = ju;o- 
For Pco;«wr) 8nd Puzo) nearly equal to zero we obtain 
D CO2 


Pn30(surf) = Pcoz(* 


H20 
Instead of equal values of py,o and pco, an inequality is found, giving a lower 
value of py. because of the presence of the stagnant layer (Dco,/Du,0 © 2/3). 
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This effect could partly account for the difference found between the various 
estimates. 

The above-mentioned change in equilibrium concentration caused by different 
diffusion coefficients also has to be included in the other reactions. Moreover, 
the effect of thermodiffusion +6) has to be dealt with in order to arrive at a 
more complete description of the model. Additional experimental evidence is 
also needed to substantiate the implicit assumption that the reactions are 
heterogeneous surface-catalyzed reactions, so that reactions in the gas phase 
can be neglected. 


7. Conclusion 


(1) Etching of Al O, with HCI is a diffusion-controlled reaction in the present 
apparatus. This is confirmed by the experimental evidence that the etch rate 
is independent of the type of substrate (single crystal or sintered polycrystal). 
The etch rate is found to depend on the equilibrium partial pressure of the 
reaction products. 

(2) The main reaction product is found to be AICI,. From the temperature 
dependence of the etch rate an enthalpy of formation for AlCl, of 
—93 + 3 kcal/mole is found at 1500 K against a tabulated value of 
—80 + 20 kcal/mole. 

(3) The addition of H,O strongly decreases the etch rate for values of Pu,o 
that surpass the py,0 created by the etch reaction itself. 

(4) Chemical transport of Al,0 3 is quite possible in a temperature gradient 
as soon as the volatile aluminium halides can be formed. 
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THE INFLUENCE OF FACET FORMATION 
ON THE LIFE OF INCANDESCENT LAMPS 


by W. LEMS, H. KINKARTZ and W. LECHNER 


Abstract 

The influence of facet formation on the life of an incandescent single- 
coil tungsten filament (130 turns of 36-um wire in 8. 10* N m^ ? argon) 
was studied. At temperatures above 3000 K the coil wire remains round 
throughout its lifetime, below 3000 K facets predominantly of {100} 
type develop. Below 2700 K the life of d.c.-operated coils is determined 
by the formation of notches and thus by surface migration. Facet for- 
mation appears to play a role in this notch formation, as the facet geom- 
etry leads to periodic constrictions in the wire diameter. Above 2700 K 
the life of both a.c.- and d.c.-operated coils is determined by evapora- 
tion. The development of facets decreases temperature and thus increases 
life compared to the situation when with equal initial temperature the 
coil wire would have remained round. In about 70% of all cases the 
site of initially highest temperature correlates with the final highest- 
temperature spot. 


1. Introduction 


A tungsten coil which burns in an inert-gas-filled lamp develops facets, as 
shown in fig. | for a coiled-coil filament. This led to the question of whether 
facet development influences the life of coils either by changing the energy 
balance of the filament or by influencing the mechanical stability. 


Fig. 1. A coiled-coil tungsten filament of a 220-V/200-W inert-gas-filled lamp near the end of 
life. Crystal planes, known as facets, have developed during life. Single crystals, consisting of 
several turns, can easily be identified due to the presence of facets. 
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Under constant-voltage operation, faceting may lead to a temperature 
decrease of the coil if the radiating area, the effective emission coefficient, or 
the electrical resistance of the coil increases. Furthermore local variation of 
these quantities may cause hot spots. Mechanical stability can be influenced if 
faceting leads to constrictions in wire diameter. 

For experiments a straight single-coil filament was chosen, since coiled-coil 
filaments and other coils with nonlinear axes pose difficulties regarding tem- 
perature measurement along the coil. (Accurate temperature measurement is 
of course a prerequisite when studying the life of incandescent lamps.) How- 
ever, the wire diameter, pitch and inert-gas filling of the lamp we chose are not 
too different from those generally found in normal incandescent lamps. The 
results of this investigation will therefore probably be valid for a wider variety 
of lamps. 


2. Experimental details 


Most experiments were performed on a stop/flasher lamp (Philips type . 
12500), which contains a filament of 130 turns of 36-um wire in 8 . 104 N m7? ` 
argon. To permit temperature measurement around the filament, a number of 
experiments were performed on tubular lamps containing identical filaments 
and gas fillings. The lamps were operated either by a.c. or d.c. 

The temperature was measured by both brightness and two-colour pyrom- 
etry, which enabled us to determine temperature differences as small as 2 K +). 
The temperature distribution along the coil was continuously recorded. 
Shortly before burn-out was expected the lamps were switched off. The coils 
were subsequently examined with a scanning electron microscope. The orien- 
tation of crystal planes was determined by means of selected-area electron- 
channelling patterns observed in the scanning electron microscope. 


3. Experimental results 


Coils which had burned at temperatures below 3000 K were observed to 
possess strongly developed facets. Figure 2 shows part of a coil which was 
operated with a.c. at 2780 K. The facets are predominantly {100} planes. When 
facets are present, single-crystal regions can easily be identified. Crystals, whose 
sizes ranged between part of one turn and the total coil length of 130 turns were 
observed. The average crystal length was about 12 turns. The coil axis usually 
lay near (110). ; l 

Figure 3 shows a coil, operated with d.c. at 2750 K, which broke after it was 
taken out of the lamp. At first sight the outsides of d.c.- and a.c.-operated coils 
look rather similar. Looking at the d.c. specimen more closely, however, we see 
notches in the turns that are part of the same crystal. These are all at equivalent 
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positions, corresponding to where the coil broke. Figure 4 shows the fracture 
area in more detail. Before fracture, notching had left only a small fraction of 
the original cross-section. 


Fig. 2. Part of a coil operated with a.c. at 2780 K near the end of life. The coil was cut up to 
allow inspection of the inside (fig. 6). The facets are identified as {100} planes. The coil axis 
lies near (110). 


Fig. 3. A coil operated with d.c. at 2750 K, which broke after it was taken out of the lamp. 
Note the notches at identical positions corresponding to where the coil broke. 


Fig. 4. The fracture area of the coil shown in fig. 3. 
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Figure 5 shows the inside of such a coil. The insides of these coils are seen to be 
lozenge-shaped. Because the facets on the outside and inside are not parallel to 
one another, constrictions arise which apparently develop further into deep 
notches. The inside of a coil operated with a.c. at 2750 K remains much rounder, 
as can be seen in fig. 6. 


Fig. 5. The inside of a coil which was operated with d.c. at 2750 K. Note that the facets on the 
inside and outside are not parallel to one another, which leads to periodic constrictions in the 
coil wire. 


Fig. 6. Compared with the d.c.-operated coil (fig. 5), the inside of an a.c.-operated (2750 K) 
coil remains practically round. 
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Fig. 7. Above 3000 K the wire remains practically round. The example shown is a 3080-K a.c.- 
operated coil. The final spot, in the form of a thin turn, can easily be distinguished. 


At operating temperatures above approximately 3000 K the coil remains 
practically round. An example of a 3080-K a.c.-operated coil is shown in fig. 7. 
The final spot, in the form of a thin turn, can easily be distinguished. 

Whereas hot spots show an accelerating increase of temperature, the tem- 
perature plateau of the more or less undisturbed part of the wire decreases with 
time. Of those filaments showing a more or less flat temperature plateau, the 
decrease of this temperature plateau during life was determined. Above 3000 K 
the decrease was much less than for lower temperatures. At about 3200 K, for 
example, an average decrease of 16 K was found towards the end of life, whereas 
at about 2800 K the decrease was 90 K. 

In about 70% of all cases the final maximum-temperature spot was observed 
to coincide with the site of the initially highest temperature. This initial spot 
of our model lamps showed AT values of about 10 to 50 K. In practically all 
other cases the final spot coincided with a second or third initial spot. Pre- 
liminary results indicate that both variation in wire diameter and coil pitch 
appear to be responsible for initial spots. The temperature distribution and 
spot development was not found to correlate with faceted single-crystal regions. 
Figure 8 shows an Arrhenius plot of the life of coils, t, as a function of the 
initial burning temperature 7, at constant-voltage operation. Above 2700 K 
no difference in life is observed between a.c.- and d.c.-operated coils. The 
slope of the least-mean-square line of the results of the type-12500 lamps 
above 2700 K, yields an activation energy of 238 kcal mol^!. Below 2700 K, 
d.c. operation appears to lead to a considerably shorter life than a.c. The drawn 
line represents an activation energy of 70 kcal mol~! in this region. 


4. Discussion 


The discrepancy between a.c. and d.c. results below 2700 K can be ad- 
equately explained in terms of the well-known d.c. electromigration effect ?:?:*). 
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Fig. 8. An Arrhenius plot of the life of coils 7 vs initial burning temperature To. Experimental 

points: A a.c.- and e d.c.-operated type-12500 lamps, A a.c.- and o d.c.-operated tubular 


lamps. Above 2700 K the results of the type-12500 lamps yield an activation energy of 
238 kcal mol` ! ; below 2700 K the d.c. results yield about 70 kcal mol” +. 


An activation energy of about 70 kcal mol~? fits well in the range of values 
generally attributed to surface migration. Hitherto these effects have mainly 
been observed in vacuum lamps below 2300 K °). The fact that this temperature 
is shifted to about 2700 K in our case can be easily explained by the fact that 
the evaporation rate is lower by a factor of approximately 10? in our gas-filled 
lamp compared with the vacuum case, whereas surface migration is probably 
not affected. When’ the gas pressure is increased this transition temperature, 
below which — under d.c. operation — lamp life is determined by notch forma- 
tion, will therefore be shifted to higher values. As fig. 5 indicates, facet forma- 
tion will in all probability play a role in the development of constrictions that 
lead to notches. 

The activation energy above 2700 K of 238 kcal mol^! is higher than the 
generally accepted value for evaporation of 205 kcal mol^! 6). This might at 
least be partly explained by the fact that the temperature plateau of the faceted 
coils shows a larger temperature decrease than is observed above 3000 K where 
the wire remains round. This implies that with respect to the initial burning 
temperature the faceted coils have burned at a lower effective temperature than 
those above 3000 K. Taking instead of T, an effective burning temperature of 
To minus half the temperature decrease during life (i.e. minus 45 and 8 K at 
2800 and 3200 K respectively) results in a lowering of the slope of the line from 
238 to 214 kcal mol^!. Both this value and the one of 191 kcal mol^! which is 
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found for tubular lamps, appear to be close enough to 205 kcal mol^! to 
identify evaporation as the life-determining transport mechanism in this tem- 
perature range. The larger temperature decrease during life of faceted coils may 
be attributed to an increase in radiating area, effective emissivity, or electrical 
resistance. 

The fact that the temperature distribution and spot development do not 
correlate with the distribution of faceted single crystals can probably be 
explained by the more or less constant crystallographic orientation (110) of 
the coil axis. This implies that the differences in orientation between the single 
crystals is mainly a rotation around the coil axis. The energy balance of these 
faceted single crystals will therefore not differ from one crystal to another. 
Obviously, however, this might be quite different in cases where, due to coil 
geometry, a constant crystallographic coil axis is not possible, as for instance 
in the case of coiled-coil filaments. Facet formation might then lead to spot 
development. 


5. Conclusions 


This investigation was primarily intended. to examine the possible influence 
of facets on life. The formation of facets itself, which occurs in the early stages 
of life, has not yet been investigated. Apart from the influence of facets on life 
the conclusions are limited to the life-determining transport mechanisms and 
the role which the initial highest-temperature spot plays. 

The final highest-temperature spot correlates in about 70% of all cases with 
the initial highest-temperature spot. The causes of initial spots are under in- 
vestigation. No correlation of temperature distribution and spot formation with 
single-crystal-faceted regions was found. As this can be explained by the fact 
that the coil axis has a more or less constant crystallographic direction, the 
situation might very well be different in cases where coil geometry rules out a 
constant crystallographic coil axis. Spot development caused by facet formation 
cannot therefore be ruled out under all circumstances. 

Below approximately 2700 K the life of d.c.-operated coils appears to be 
determined by surface (electro) migration, which leads to notches. Burn- 
through is then determined by mechanical instability. Facet formation seems 
to play a role in this notch formation, as the facet geometry leads to periodic 
constrictions in wire diameter. 

Above 2700 K evaporation is the life-determining transport mechanism. 
Between 2750 and 3000 K facet formation leads to a temperature decrease of 
the coil, and therefore to a longer life than would be obtained if the wire 
remained round. 
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MULTIDETECTOR PM WITH MICROCHANNEL PLATE 


by M. AUDIER and J. P. BOUTOT 


Abstract 


This article presents an analysis of a new type of photomultiplier which 
combines, in the same envelope, a photocathode, a microchannel plate 
and a multianode collector. We have examined in particular the limits of 
linearity of the plate response under pulse operation, and the operation 
of the cross-bar collector (anode-collection efficiency and interelectrode 
cross-talk). The realization of an experimental tube demonstrates the 
validity of the localization system used. However, the lack of uniformity 
in gain of the plate would seem to limit the use of this detector to 
particle counting. 


1. Introduction 


Due to its large size, high current gain and its spatial resolution, the micro- 
channel plate is an especially well adapted component for the design of an 
image-detector tube. 

To be able to discriminate between the amplitudes of the detected signals, 
this multiplier should have a linear response. The first part of this analysis 
therefore tries to determine the linear-response limits of the microchannel plate, 
as a function of both amplitude and pulse rate of the analyzed pulses. 

The multianode collector considered uses the “cross-bar” principle that 
permits a large number of distinct “image elements” to be identified by using 
a relatively restricted number of data circuits. The study of the collection of its 
various electrodes, as a function of their geometries and of the characteristics 
of the incident signal, results in the design of an operational collector. 


2. Linear-response limits of the plate 


Various physical parameters act to limit the performance of a micro- 
channel plate. Some of these are already familiar, such as +): 

— the negative space charge which limits the maximum instantaneous current 
in short pulses; 

— the positive charge progressively acquired by the glass wall which puts a 
maximum on the total charge that can be drawn out of a channel in pulse 
operation; ; 

— the conductivity of the wall material which, on the one hand, determines 
the maximum average current that can be delivered and, on the other hand, 
in the case of pulse operation, requires a recovery time to be taken into 
account. 
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Another parameter can be of importance during the detection of multiple 
electron pulses by a plate in which each microchannel is operating in the 
saturated mode (linearity of the response then being assured by the multiplicity 
of channels participating in the amplification of each pulse). This new limitation 
comes from the effect of a positively charged microchannel on the adjacent 
channels. 

All these limitations, whose dimensions must be known in order to be able 
to optimize the use of an image-analyzer tube, are brought out by the study of 
the plate response when its input is activated by multiple electron pulses. 

The study of the current-pulse transfer function shows the limits due to the 
positive charge acquired by the microchannels. Since one electron is the mini- 
mum input signal per microchannel, this saturation in charge will also limit the 
gain of the plate in multiple-electron-pulse operation. 

The study of frequency response of the plate as a function of the level of the 
pulses detected shows the limits due to the conduction current, the influence of 
the operation of a microchannel on its adjacent channels and the maximum 
recovery time of a microchannel. 


2.1. Maximum output charge per microchannel 


During the propagation of a short pulse inside a microchannel there appears 
an accumulation of positive charges on the wall of the channel, especially near 
the output. This phenomenon causes the electric field to increase slightly in the 
area near the input, but especially to decrease abruptly towards the output until 
the multiplication process can no longer be assured. Thus, for the duration of 
the pulse, the gain of the channel will decrease progressively until the process 
appears to be blocked. The shape of the pulse is therefore modified and the total 
charge carried by the output pulse is limited. . 

If we disregard the space-charge effect along the channel, the electric field E 
remains constant, as does the gain per unit length. The electron current therefore 
increases according to an exponential law in the form: 


I= Io ora, 
x 1 
1= ew (55), 0) 
a 


x being the distance from the channel input, 7; the value of the input current, 
and a the distance travelled by an electron inside the channel of radius r under 
the action of the axial electric field E: 

le Er? 


a=-—E?® ^ 
m Vo 


e Vo being the average energy of the secondary electrons leaving the channel 
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wall and c the average time of flight of these electrons: 


e 1/2 
225 i 
m 


The equation (1) gives the approximate current in the channel at a distance x 
from the input. An annular element of length 4x taken on the channel wall 
should then supply a current 


ln ô x 
AI = Ax To exp (: In 2) 
a a 


During time £, this element charges positively by a quantity AQ = t AI. 
This results, at a distance x from the input, in an average space-charge density of 


AQ 
mr? Ax’ 
In ó x 
—— exp C In 8) 
a a 
By integrating the Poisson equation d?V/dx? = — o/e, we obtain 
dV t x 
— = Io exp DIE 
dx mer? a 


The output charge at distance / is equal to 


l 
0 - th exp(7m8). 
a 


We easily calculate an order of magnitude of the maximum charge which 
corresponds to a cancellation of the electric field at the output of the micro- 
channel considered: 


Omax = Er? E. 


For a plate (used with a gain near its maximum value) consisting of channels 
40 um in diameter, Qmax is of the order of 10714 C per channel or 10° electrons. 

An analysis of the current-pulse-transfer function of such a plate gave a 
maximum-charge measurement of 2-9. 107!* C (i.e. approximately 1-8 . 105 
electrons per channel), its own gain being 1:4 . 10°. 


2.1.1. Current-pulse-transfer characteristic 


The equipment used for these measurements includes (fig. 1): 
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(a) the vacuum chamber in which the multiplier in question is placed, facing the 
electron source; 

(b) the electron source consisting of an electron gun capable of supplying 
almost rectangular pulses of duration between 50 ns and 5 ps, and charges 
varying between a few electrons and around 6 . 10° electrons; the pulse rate 
is set at less than 100 pulses per second so that the recovery time of the 
channels has not to be taken into account; 

(c) a calibration multiplier, placed between the gun and the plate input, 
consisting of a calibrated focussed-mesh multiplier; 

(d) a linear amplifier having a gain of 40 dB and a bandwidth of 100 MHz; 

(e) a 150-MHz oscilloscope. 

The plate examined has the shape of a rectangular panel with the dimensions 
70x 50x 2:4 mm?. It includes channels of 40 um in diameter, which gives them 
an L/D ratio of 60. 

The transfer characteristics obtained (fig. 2) show the changes in maximum 
amplitudes attained at the beginning (/s,) and at the end (Js,) of each output 
pulse of the plate when its input is activated by multiple electron pulses of 
increasing amplitudes. 


Fig. 1. Experimental device. 
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Fig. 2. Current-pulse-transfer characteristic of the microchannel plate. 


The results obtained show that the range of linear amplification is much 
wider in pulsed operation than in continuous operation. The size of this range 
is inversely proportional to the width of the detected pulses. 

With charge saturation of each output pulse being attained, we observe that 
only the peak value of the beginning of the pulse (/5,) changes proportionally 
to the primary signal, with the maximum value at the end of the pulse (/5,) 
tending very rapidly towards a change inversely proportional to the incident 
signal. 

If we assume that the quantity of charges extracted undergoes only a neg- 
ligible change, i.e. that the area of the oscillogram showing its image remains 
virtually constant (verified by a graphic integration), the quantity of charges 


extracted is equal to 
T Js 
Is, I 
N, ( S1 ^S2 2 


where N, is the number of channels participating in the multiplication of the 
detected electrons. 


Q= 


This measurement made with various plate gains and for various types of 
primary pulses shows that the maximum charge which can be extracted from a 


MULTIDETECTOR PM WITH MICROCHANNEL PLATE 231 


microchannel of 40 um (operating simultaneously with a whole group of 
channels) is of the order of 


Osmar © 2.1067 channel 7 


(this saturation charge being proportional to the electric field existing inside the 
channel). 


2.1.2. Frequency response of the plate 


The test circuit used to analyze the response of the wafer in the presence of 
recurrent electronic pulses consists of an amplitude spectrometry system 
including (fig. 1): 

(a) an amplifier ensuring the charge/amplitude conversion and giving a rise- 
time constant of about 1 us to the pulses to be analyzed; 

(b) a 400-channel pulse-height analyzer analyzing the voltage pulses of ampli- 
tude V proportional to the plate output charges; a class of amplitudes V, 
corresponds to each channel of the analyzer in which a number n, of pulses 
(n, < 10°) can be registered; ; 

(c) a frequency meter, triggered by the pulse generator modulating the Wehnelt 
of the electron gun, whose operating range is between 0 and 1 MHz. 


— —^» Electron number Ne- (channel 7!) 


1 10 0 — m —n* — 
—— Frequency (Hz) 


Fig. 3. Changes in the maximum charge which can be extracted in linear operation as a func- 
tion of the pulse rate of the detected pulses. 
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Plotting of the charges Q, delivered by the plate, as a function of the pulse 
rate of the primary pulses, gives the characteristic curve shown in fig. 3. This 
curve shows the maximum charge which can be supplied, in linear operation, 
by a set of multiplier microchannels (this charge being reduced to one micro- 
channel) as a function of the pulse rate of the pulses analyzed. 

` Several remarks can be made: 

— When the quantity of charges inside a pulse approaches 2.10? electrons 
per channel (charge saturation), the maximum pulse rate of the pulses 
analyzed is of the order of 200 pulses per second. This limit represents the 
recovery time of a microchannel operating at total saturation (time required 
for the neutralization of the positive charges on the wall, resulting from the 
multiplication of the detected pulse, by the charges propagated by the con- 
duction current). 

— At this same point of operation, if we compare the quantity of charges 
extracted per pulse (reduced to one microchannel) with the charges supplied 
by the conduction current, we note, for a period, that the ratio existing be- 
tween these two propagations is about 1/10 (ratio usually observed between 
the level of appearance of saturation in continuous operation and the con- 
duction current of the plate). 

— When the charge extracted by each pulse is much less than the maximum 
charge (1/10), the maximum counting rate becomes inversely proportional 
to the extracted charge, therefore directly proportional to the number of 
channels available for the multiplication of each incident electron. In ad- 
dition, the ratio of the charges extracted per pulse or charges supplied by 
the conduction current changes from 1/10 to about 4/10. The change in this 
ratio is attributed to the effect of a positively charged microchannel on the 
operation of its adjacent channels. 


2.1.3. Zone of influence of a microchannel 


This is displayed in fig. 4 in which the abscissa represents the charge prop- 
agated by an output pulse reduced to one microchannel, and the ordinate the 
product of this charge by the maximum corresponding pulse rate in linear 
operation. 

When the plate operates at a voltage of 1300 V (curve (1) in fig. 4), the single 
electron gain (mean intrinsic gain of one microchannel operating separately) is 
about 2-4. 105; for this supply voltage, the maximum charge that a micro- 
channel can deliver, when it is part of a group of channels operating simultane- 
ously, is about 1:95 . 105 e~ /channel. If we assume that this difference is due to 
the interaction of the operation of a microchannel with its neighbours, the 
general shape of curve (1) results from the following various types of operation: 
(a) Zone A: for each pulse, the number of electrons supplied to the plate input 

is much smaller than the number of channels used. The microchannels, 
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Fig. 4. Changes in the extracted charge in linear operation from a microchannel as a function 
of the number of microchannels available for the multiplication of each incident electron. 


working separately, restore 2-4 . 105 secondary electrons per primary elec- 
tron received. The linearity of the frequency response of a channel being 
limited by its own recovery time, the product of the number of primary elec- 
trons in each pulse and the maximum pulse rate of these pulses is constant. 

(b) Zone C: for each incident pulse, the number of electrons accepted by the 
plate is no longer small compared to the number of available channels. The 
effect of the positive charge acquired at the output of each channel starts to 
be seen on the operation of the other channels; this limits the charge 
extracted from each channel. The product of this charge by the pulse fre- 
quency rapidly diminishes toward the point D at which all the channels 
operate simultaneously and, by mutual effect, reduce the multiplication 
factor from 2:4. 105 to 1-95 . 105. 

(c) The position of point B, the transition point between “individual-channel” 
operation and “grouped-channels” operation, allows us to determine the 
minimum number of channels required by each incident electron to be 
multiplied undef optimum conditions; this number is equal to the quotient 
of the intrinsic gain of a channel (2-4. 105) by the value, reduced to one 
channel, of the charge delivered at the output of the plate (2 . 10*), i.e. 

N, — 12 channels per electron (G — 2:4 . 105). 


The same process may be used for the other two operating conditions, giving 
respectively: 
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Vg — 1200 V, G = 14.105, N, = 7 channels/e, 
V, = 1100 V, G = 6:3 . 10+, N, = 2 channels/e. 


The presence of a metallization on the output side of the microchannel 
plate on the one hand and the distribution of the positive charge acquired 
by a channel after operation on the other hand, makes it difficult to find a 
mathematical relationship between the gain of a channel and the extent of 
its zone of influence. Experimentally, this analysis could be carried on with 
respect to higher gains (2 . 105) with plates operating in the saturated con- 
dition (chevron-shaped plates or plates with curved channels). At any rate, 
in the case of high gains, the output metallization, imposing its potential, 
will limit the zone of influence to a few channels. 


3. Cross-bar multianode collector 


The function of this collector is to simultaneously deliver two electric pulses 
per event detected (X, and Yj) permitting the immediate localization and 
determination of the amplitude of an electric signal coming from the micro- 
channel plate amplifier. 

In a first design, this collector consists of two planes parallel to each other.’ 
Flat metal bands, parallel and insulated from one another, are applied to the 
base plane. One plane located a little closer to the electron input (coming 
from the plate output) is constructed with wires stretched out parallel to each 
other, in the direction OY perpendicular to the direction OX of the bands 
(fig. 5). 

Unfortunately, the geometry of this collector is so anisotropic that it leads to 
distortion of the electric field, promoting a flaring of the primary beam and a 
spreading of the secondary emission from the bombarded elements. This distor- 
tion, combined with the inductive and capacitive coupling between the various 
electrodes of the multianode, considerably modifies the anode-collector ef- 
ficiency (> + 10%) and creates an interband interference of more than 25%. 
This reduces the amplitude of the main signal considerably and can cause a 
localization error of more or less one “image element”, which is unacceptable. 

A correct collection efficiency for each element (+ 1%) can however be 
obtained by reducing to a minimum the distortion of the equipotentials at the 
level of the multianode. The use of multilayer-photo-etchifig techniques would 
seem to offer the best results. Unfortunately, the use of this technique does not 
presently seem practical for large-sized components (diam. > 100mm, 
e >25 mm). The interelectrode interference factor can be reduced to a value 
of less than 3% by the use of charge preamplifiers, having a low input 3 
ance (< 100 ohms) on each anode. 
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Signal output 
a) 


b) 


Fig. 5. (a) Principle of a "*cross-bar" multianode; (b) experimental multianode. 


4. Experimental tube 


The design of an experimental tube using the proximity focussing principle 
brings together, in the same envelope, the following elements (ref. 2; fig. 6): 
— a bialkaline photocathode (Sb-K-Cs) deposited on an input window 7 mm 

thick, partitioned to the size of the “image elements" defined by the inter- 
section of a wire and a band of the multianode; this partition, optically 
isolating each element from its neighbours, constitutes an anti-interference 
system; it consists of a metal grating embedded in the window glass; 


236 M. AUDIER AND J. P. BOUTOT 


Channel plate Partition grid Photocathode 
j Multianode 


EE Al bv, 
man 


a) ] 


Fig. 6. (a) Diagram of the experimental tube; (5) input window of the tube partitioned by a 
metal grid; (c) prototype of multidetector tube. 


— a large-sized microchannel plate (50:456: 2-4 mm?) including 40-um 
straight channels (L/D — 60); 

— a cross-bar collector consisting of 5 bands and 4 wires, dividing the photo- 
multiplier into 20 elementary detectors (each detector having a useful area 
of 9x9 mm?). 

The dynamic response of this tube (energy resolution and spatial localization 
capacity) was analyzed by using as a light source a y-ray source of cobalt *) 
associated with a mosaic of 20 Nal (TI) miniscintillators. One image element 
corresponds to each scintillation. With the cobalt source being successively 
placed on each scintillator, the signals registered on each anode element con- 
stituting the systems X and Y are analyzed (amplitude and statistical fluctua- 


*) Co-57, source of 122-keV gamma rays (energy of the medical X-ray field). 
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tions) by an amplitude spectrometry system. This consists of an amplifier having 
a very low input impedance (~ 20 ohms), sensitive to the charge, and a selector 
of 4096 channels (fig. 7). 


4.1. Multianode-collector efficiency 


According to the results obtained (fig. 8), the different anodes constituting 
the collector collect a signal directly proportional to their cross-sections by the 


Multidetectortube 5000.0 


Collimator Pb » (iri d 


y-ray source 


Pulse-shaping 
amplifier 


Scintillator array NaI( [ Current preamplifier 


Fig. 7. Measuring equipment for dynamic operation of the multidetector tube. 
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Fig. 8. Amplitude spectra of the pulses obtained with a 122-keV y-ray Co-57 source placed 
successively on the various image elements defined by a band (Ys) and four “wires” (X1, X», 
Xs, X4). 


238 M. AUDIER AND J. P. BOUTOT 


image element. This permits the possibility of using a single electrode per detec- 
tion cell (wire or band) to classify the amplitude of the various detected signals. 
Since the signal collected by the second electrode is then used only to establish 
the “X-Y coincidence" permitting the spatial localization of the detected signal, 
it may represent only a small part of the analyzed signal (for this experimental 
tube, the wire and band collections were fixed at 53:3 % and 46-7 % respectively). 


4.2. Energy resolution 


It should be recalled that resolution in nuclear spectrometry is given by the 
ratio R = AS/S in which AS is the full width at half maximum of the Gaussian 
curve centred on S (average value of the detected signal) ?). 

Now, the full width at half maximum AS and the standard deviation of the 
signal c = (S? — S2)!/? verifies the relation 


AS = 2:36 c. 
The resolution is therefore 


AS o 
R = — = 236 —, 
S S 


R = 2:36 (Vis)! ^, 


where Vs) is the square of the relative deviation (or relative variance) of the 

signal. 

The signal (S) results from several events in series: 

— conversion of the radiation detected by the scintillator NaI (T1) into photons 
located in the band of sensitivity of the photocathode; the number of 
photons created is N; 

— coupling C between the scintillator and the photocathode; this depends 
mainly upon the optical transmission of the input window; 

— quantum gain of the photocathode: Q,; 

— collection efficiency of the microchannel plate: 7; 

— gain of the electron multiplier: G. 

Let Vy Vo, Vie,» Von and Vigy be the relative variances of these different 
parameters. 

If we assume a binomial law of distribution, the root-mean-square deviation 
defined by the analysis of statistical fluctuations is written: 


o = [Na (1 — a)? 


where N is the number of events, and a the probability of the event. 


The relative variance is therefore written: 
o2 


Vn = > 


(N a)? 
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(in the case of an exponential law of distribution, the relative variance of the 
signal is equal to 1). 

The relative variance of the signal S, obtained at the output of the photo- 
multiplier, is 


“2 " Vey Fen Vey 
NC NCQ, NCQ 


This result shows the importance of the optical coupling (Vro) between the 
scintillator NaI (T1) and the photocathode. In the present case, the optical 
partitioning of the window, which perfectly defines each image element, intro- 
duces a light loss of about 70%. This loss can be greatly reduced by the use 
of a thin window supportrd by a metal grid. This new window has an optical 
transmission of 80%. Its use should permit the spectral resolution obtained 
with 122-keV y-rays (Co-57) to be improved from 60% to about 35%. 

A second improvement can be made by changing the quantum gain (Q,) of 
the photocathode. In our case, this is about 10%. But, at the present time, 
obtaining a quantum efficiency of 20% would present no problem. A quick 
calculation shows that, for such a gain, the spectral resolution obtained would 
be 25%. 

Other improvements can be imagined to improve the results obtained. 
Among these, one of the most interesting to be noted is the use of a plate with 
curved microchannels *:5) whose relative variance Vc, is less than 0-4 (Vig) = 1 
for a plate with straight channels). This new component realized at our labora- 
tory has allowed a new photomultiplier (PM HR 400) to be designed permitting 
a resolution of 45% to be obtained on the single-electron spectrum. With the 
cobalt source used, the resolution would therefore be of the order of 17%. 


Vs, = Va + 


4.3. Uniformity of response of the multidetector tube - 


This depends mainly upon the spatial uniformity in gain of the microchannel 
plate. This uniformity results mainly from the various degassing processes used 
before its use (cycles of baking and degassing by electron bombardment) and is 
still not completely satisfactory. A special study, specifically centred on the 
preparation and degassing of plates, should permit defining the limits of 
uniformity that can be expected with a large-size plate (a uniformity of better 
than 95% can be obtained with a plate which has not been subjected to any 
baking cycle or bombardment). 


5. Conclusion 


As we have just seen, the design of a PM permitting not only the detection of 
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scintillations but also their localization finds a very attractive solution in the 
use of a microchannel electron multiplier. 

Dynamic tests of this component show that, regardless of the pulse rate and 
amplitude of the signal analyzed, the microchannel plate can be used in linear- 
response operation. Limited for each microchannel to an extracted charge on 
the order of 2 . 105 electrons (in the case of 40-um microchannels), this response 
increases linearly with the number of channels used. 

The response of the microchannel plate, as a function of the rate and ampli- 
tude of the multielectron pulses detected, shows that a positively charged micro- 
channel limits the multiplying power of its nearest neighbours. However, the 
use of a group of at least 7 microchannels per electron detected should permit 
multiplication factors much higher than 2 . 10? to be attained ^). 

Coupling, by proximity focussing, of the microchannel-plate amplifier with 
a plane cross-bar collector, permits acquiring locally information in the form 
of electric signals capable of being processed in real time. 

However, the lack of spatial uniformity in gain of the large-sized plate up to 
now limits the possibilities of amplitude selection of this detector. 

The combination of a microchannel plate with a mosaic of insulated anodes 5) 
seems to offer a more flexible solution for obtaining the desired functions 
(detection, amplification, amplitude selection and localization). However, the 
use of such a combination is conceivable only for a detector having only a few 
tens of image elements. 

Among the applications of such a multidetector, we could cite for example 
hodoscopes, Cerenkov counters, gamma cameras, etc. 

Generally speaking, this multidetector could be applied to any system that 
formerly used several conventional photomultipliers. 


Laboratoire d’ Electronique et Limeil-Brévannes, April 1975 
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CALCULATION OF THE TEMPERATURE DEPENDENCE 
OF THE PRIMARY AND SECONDARY PARAMETERS 
OF COAXIAL CABLE 


by R. R. WILSON *) and G. C. GROENENDAAL 


Abstract 


The primary and secondary parameters of coaxial cable have hitherto 
been calculated by means of approximating formulae, and the tempera- 
ture dependence of the parameters has been found by differentiation of 
these formulae with respect to temperature. This method can result in 
great inaccuracy and will therefore not be used here. This article presents 
a derivation of the temperature dependence of the said parameters on 
the basis of a known mathematical model of the coaxial cable. 'The 
formulae thus obtained are applied to a 2:6/9:5-mm coaxial cable. The 
result of the calculation is found to accord well with measurements 
found in literature. 


1. Introduction 


When coaxial cable is used as a transmission medium a good knowledge of its 
electrical properties as a function of frequency and temperature is necessary. If 
the cable losses in a particular application are too high, repeaters will have to be 
inserted in the cable at regular intervals. To enable these repeaters to function 
properly it is most important that attenuation variations in the cable resulting 
from temperature changes should be compensated for. Temperature variations 
in cables have already been investigated in a number of publications +°). The 
temperature coefficient of a quantity A is defined as follows *): 

] 0A 


“Ab? 
where 0 represents the temperature. In the publication referred to above, cal- 
culation of the primary and secondary parameters is always based upon approx- 
imation formulae. If these quantities are differentiated with respect to tempera- 
ture, the resultant inaccuracy can be considerable. 

This article will give a derivation of the temperature coefficients of the 
primary parameters Ap, Ar, Ag and Ac and the secondary parameters A,, Ag 
and Ajz5). 

A 2-6/9:5-mm coaxial cable has been chosen as a numerical example. The 
influence of the two spirally wound steel tapes and the outer domain on the 
various electrical quantities have been ignored. A longitudinal and a transverse 
section through this cable are shown in figs 1 and 2, respectively. 


(0) 


*) NKF Kabel B.V., Delft, The Netherlands. 


242 R. R. WILSON AND G. C. GROENENDAAL 


LLLLELLLIL LILLIA LLL LLL, 
SESS MASS SSSA SSSA 


LZ, M 
SSNSSSSSNSSNNNNSNNNNNNNSNNNNSNNNNSNNS ANN 
VIII B IET Bg áBgág,PgPMAÁISAITTB EE gg IEEE 


Fig. 1. Transverse section Fig. 2. Longitudinal section through 


through coaxial cable. coaxial cable. 


2. The temperature dependence of resistance and inductance 


2.1. The inner conductor 


The following formulae apply to the resistance R,, and inductance L,, of 
the inner conductor ?:*): 


u A; 
Rag = EE Rao Peay | 
2 D; Q) 
u B, 
w Laa Ro--— 
i 
1 
with Rao => , 
z a? o, 
u = a (w uo o.) ^, 
A, = ber u bei’ u — bei u ber’ u, 
B, = ber u ber’ u + bei u bei’ u, 
= (ber' u)? + (bei' u)?. 
. Application of definition (1) yields: 
LM = Ara + Ay + Na; aa Ap» (3) 
Aaa = Argo + Au + Àp, — Ad, (4) 


We can now express all temperature coefficients in terms of A, It follows from 
(2) that 


By logarithmic differentiation with respect to 0 we find: 
Argo = —2 Ay. 


z 


EH 
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We next calculate the temperature coefficients of 4,, B, and D,: 


Corresponding formulae apply for B, and D,. Use of ref. 5, 


ber'u 
ber” u = d + bei «) 


u 


bei’ u 
bei” u = ( — ber n) 


u 


and 


yields after a certain amount of conversion °): 


Di u 
with E, = ber? u + bei? u. 
Substitution in (3) and (4) yields: 


i 4 (= 2 (5) 
= —U A, | ——-— h 
om D, Ay 
2A; Di 
Avan =—ud,{—— z) (6) 


ArjalArag and Ar, /Ag,g are shown as functions of u in figs 3 and 4. It also 
follows from (2) that 


A, = Aa + lA. (7) 


2.2. Outer conductor 


The following formulae apply to the resistance Ry, and inductance La» of the 


outer conductor °): 
. ky, N 
Ry = Re ( : =) 
2x be. 3 Do 


ks No 
oL, Im ( — 1:115 
2% b 04 Do 


244 R. R. WILSON AND G. C. GROENENDAAL 


Fig. 4. The normalized temperature coefficient of Laa versus u. 


with k3? = JO Uo 95, 
No = Io(ks b) K(k; c) + Kolk b) I, (Is c), 
Do = K; (ks b) Ls c) — I, (ke b) Ky (ks c). 
Substitution of 


u, = b (o po o3)? 
and 
Uz = c (W Ho G3)*/? 
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yields u PT—QS 
Ry = Roo — —— > (8) 
(02 $?+4 7? 
u PS+QT | 
w Ly = Ro — ——— > (9) 
2 S?+T? 
with P = —ber u, ker’ u, + bei u, kei’ u, + ker u, ber’ u, — kei u, bei’ u2, 
Q = —ber u, kei’ u, — bei u, ker’ u, + ker u, bei’ 4; + kei u, ber’ uz, 
oP [o 
LANE ML A 
Ou, OU, 
oS S dT T 
peo eis cp 
ou, Ui ou, Un 


In analogy with the inner conductor let us now suppose: 
A; —PT—QS; Bj,2PS--QT; E,—P?--Q?; D= S? - T?. (10) 
Application of definition (1) yields: | 
Án, = Ano + Au, + Aag — Avg» (11) 
Áp,, = Argo + Au, + Ang — Any: . (12) 
In analogy with the inner conductor we now have: 
Arso = —2 Au, 


The temperature coefficients of Ap, By and Do are worked out in the appendix. 
Substitution of (A.11) and (A.12) in (11) and (12), respectively, yields 


24o Eo\ în T?— S? 

dudum ; (13) 
24o Do à, 2ST 

ftg = U1 Au, | ——— zJ- ! (14a) 
Do Bo uy Bo Do 

It follows from (8) that 
l Ay =A, + Ag , 

m dolis (14b) 


Aue =A, +4 fez 


3. The temperature coefficient of the primary parameters 
We now move on to the calculation of the temperature coefficients of the 
primary parameters. The total AC resistance is 


R= Ra + Ry. 
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Logarithmic differentiation in accordance with (1) then yields 
Raa ÅRaa + Ry, Any, 
Ag = ———_———__.. 
R 


In calculating the temperature coefficient of the total inductance we have to 
make allowance for the external inductance 7): 


(15) 


Ho 
Le = — ln (bla 16 
2. (bla) (16) 
or 
iet (17) 
že in (bjd) 
The total inductance now becomes 
L= Laa + Ly, -+ Le 
We then have: 
: Laa Ataa + Lob Arp + Le Îr, 
A ee (18) 
L 
The capacitance of the coaxial cable is rendered by ?) 
VET 
— In(b/a) ` 


Logarithmic differentiation then yields the following formula for the tempera- 
ture coefficient of the capacitance: 


r= =A (19) 
C 7* nba) 
Furthermore: 
i G = o C tan 6’. 


From this it follows: 
Ac m he + Atan ó'. 


It is ‘a known fact that for modern cable-insulation materials the loss angle 
in the temperature range which interests us here (—10 to -|-30 ?C) is practi- 
cally independent of temperature; À tan 6’ is therefore much smaller than Ac 
and we may assume 


le = 1e (20) 


4. The temperature coefficients of the secondary parameters 


In this section the temperature coefficients of the attenuation constant, the 
phase constant and the modulus of the characteristic impedance will be derived. 
In the formulae which follow it has always been assumed that tan? 6’< 1 ^4). 
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4.1. The attenuation constant 


We start from the following formula *): 
LC \12 
a=W (=) [p tan 6’ — 1 + (1 + p?) 172 


ith 2 
wi = —. 
p wL 


Let H = [p tan 6’— 1 + (1+ p?)!?]!2, then logarithmic differentiation 
yields 


da= T Ap tthe baud 
H op 
d 1 p 
with A, =Ag—A, and (t) Q1) 


and after substitution we obtain: 
p [tanó' + p/( + p?)!?] 
K ) (22) 


À, — (4A A iL) MN ERU cM CER CN 
(ac GRE a ee 


4.2. The phase constant 
Here we start from *): 
B 5) p+ tan ó' 
= wo | — ———————————————— . 
2 / [ptanó' — 14+(1 + pyre? 
We then find in the same way as above: 
p 
Ag = —Àa + Ay + Ae + Ag — fr) ———— 23 
B L c + a 1) EEE (23) 
4.3. The characteristic impedance 
We start from the familiar definition 


($m 
Zo = |————_] . 
G 4- jo C 


R? -+ w? I? 1/4 
pas (s Fo? s) 


Then 


which may be rewritten as 


L 1/2 
ES Me 231/4 
zol=(=) a». 


CO? a! OM 
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The temperature coefficient of |Zo| now becomes: 


pA, d(1 + p?)/^ 
Aizol = $ (A, — Ac) + EIE EE TEES 


(1 + p»yll dp 
Substitution of (20) finally yields 
p 
Aizgl = (à + Q.— 2) —) (24) 


With these derived formulae it is possible to calculate the various temperature 
coefficients, assuming that the A which depend on the materials and/or cable 
construction used are known. These are: 


rm Ap; Ae, r hes and Ae 


A computer programme was written which calculates Ag, Ap, Ac, Ags Ag and 
Aizgi- In the next section the theory will be applied to a practical example. 


5. Practical example 


The computer programme was applied to a 2-6/9-5-mm coaxial cable recom- 
mended by the CCITT. This cable is of the disc insulation type (see fig. 2) with 
an inhomogeneous dielectric, so that some complications are involved in the 
calculation of A,, 4, and À. 

The discs are assumed to have a thickness d,, their spacing to be s and the 
dielectric constant of the (polythene) disc to be z,. It then follows that the 
relative dielectric constant of the inhomogeneous dielectric 9) is 


d, 
& = 14 (6 —1) —. (25) 
s 


Logarithmic differentiation of (25) yields 


dcs £p (4s, + Aap) = Aa, f Q6) 
£y — l4 s/d, 

It has been assumed here that the centre distance of the discs will not change 
with temperature (A, = 0). 

Two steel tapes are wound round the outer conductor, which consists of 
copper foil folded in the longitudinal direction. The main difficulty is in cal- 
culating 4,,. The discs expand in the axial direction not only as a result of the 
coefficient of linear expansion but additionally owing to the fact that expansion 
in a radial direction is prevented by the steel tapes. , 

In a calculation of 4, and 24, performed by Videc 5) the steel tapes are 
replaced by a solid steel tube having the same dimensions. The result of that 
calculation is as follows: 


TEMPERATURE DEPENDENCE OF PARAMETERS OF COAXIAL CABLE 249 


E, d, eV? 
imag +S Pfi nthi) a+] 
E, s c 
,, 216) + (A v2 0 — a/e?) aa — 9) tfe? + 1 + vila 
(e?e? — 1) (1 — a]c?) (1 — n2?) 


? 


Q7) 


29; a a? 
n-anama taea] — e» 


where o, is the coefficient of linear expansion, E, the elastic modulus and 
Ym the Poisson number in medium m (see fig. 1). In the temperature range rele- 
vant to the cable, Em, &m and v, are independent of temperature. If we start 
from the following nominal data: 


2a — 2:62 mm, 2c — 10-0 mm, d,— 2mm, 
2b = 9-52 mm, 2e — 10-5 mm, s = 30 mm, 


then we get the values as given in table I. 


TABLE I 


Em (N/mm?) 


Gm (T7 1) 


Substitution in (26) and (27) yields 
Ae = &4 + 1-17 . 107? (0:1373 o, + 1:2108 æ> — 1:3481 o4), 


hay = 0:0732 a, -+ 1:8572 «2 — 0-9204 a. Q9) 
These formulae show that in view of the values of «,,: 
A, = 6 
T (30) 
Aa p = 1-86 Q5. 


It will be seen from fig. 1 that (¢ is the thickness of the copper foil), 


c=b+t. 
Then 


b t 
À = = Ay += dy. 
c [4 
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We assume that ¢t « b and also that the change of thickness of the copper 
foil due to temperature will be extremely small. 
We can then say that 


Ay = — he (31) 


Computer programme 


The following input data are necessary for the computer programme (ambient 
temperature 10 ?C): 


2a = 2:62 mm, l = 1000 m, 

2b = 9:52 mm, A, = —1:943 . 1075, 

2c = 100 mm, he, = Åe = —412.107?, 
0, = 03 = 60. 10° (S/m), Aq = 16-60 . 1076, 

tan ó' = 0-3. 1074, Ay = 12:83 . 1076, 

& = 1:085, Ae = 12:21 . 1076. 


The results of the computer calculations are shown in table II. 4,, A, and 
Aizo are plotted as functions of the frequency in figs 5, 6 and 7. The measured 
results found in the literature are also pictured. 


6. Conclusion 


On the basis of a mathematical model of a coaxial cable known from literature 
the temperature coefficients of the primary and secondary parameters have been 
worked out. | 


———— 
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Fig. 5. The temperature coefficient of the attenuation in a 2:6/9-5-mm coaxial cable versus 
the frequency (at 10 °C). 
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Fig. 6. The temperature coefficient of the phase constant.of a 2- 6/9- 5-mm coaxial cable 


versus the frequency (at 10 °C). 
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Fig. 7. The temperature coefficient of the modulus of the characteristic impedance of a 
2:6/9-5-mm coaxial cable versus the frequency (at 10°C). 


The derived formulae can be used as basic formulae for the calculation of the 
temperature coefficient of any type of coaxial cable. They are here applied to a 
2-6/0-5-mm cable as recommended by CCITT. The results were found to be in 
good agreement with measurements known from literature *). m 


Appendix 
On the basis of definition (1) we find: 
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We now pass on to the calculation of the partial derivatives occurring in (A.1). 
For this purpose we introduce: 


U = beru, keiu, + beiu, ker u, — ker u, beiu, — keiu, ber uz, 


V = beru, keru, — beiu, keiu, — keru, beru, + keiu, bei uz, 


W = ker' u, bei u, + kei’ u, ber u, — ber’ u, kei u, — bei’ u, ker uz, Us) 
Z = ker’ u, ber u, — kei’ u, bei u, — ber’ u, ker uz + bei’ u, kei uz. 
The following relations can now be verified: 
oP P oV 
— ——— + U, P =—-—, 
Ou, us duz 
d oU 
nt 
u u u 
2 2 2 (A.3) 
oS S dZ dQ 
—=——+ PF, S=-—, =—-—, 
Ou, D Qus Ou, 
oT T oW oV 
cup Tut. E 
OU us OU Ou, 


On the basis of definition (10) and applying the above relations we find the 
following for the partial derivatives with respect to u,: 


0Ao 1 E, 

Ou, =. u, B Ao j 

9B, ps E, (A4) 
Ou, uU; Bo 

9D, l Ao 

Qu; E (- u, i 2) 


For the partial derivatives with respect to u, we find: 
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I pou ce (A.5) 
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These formulae can be simplified still further. For this we start from the relations 
occurring in ref. 7. Let 


Po = I(x) Ko(y) — Joy) Ko) , 
qo = Ix) Ko’) — fO) Ko(x) , 
ro = lo'(x) Ko(y) — O) Ko'(x), 
So = Io (x) Ko'(v) — Io'Q) Ko'C9, 


with x = u, exp (2/4) and y = u, exp (jx/4), j being the imaginary unit. 


(A.6) 


We now have ?) 
1 


(A.7) 


Po So — do l'o = — 
JU, U2 


It is now possible to express po to so in terms of P to Z (see (10) and (A.3)). 
We find: 
Po =V+jU 
qo exp (jx/4) = —P —j Q, 


ro exp (jn/4) = —Z—j W, in 
JSo —8S-jT. 
Substitution in (A.8) and equating real and imaginary parts yield 
1 
VS—UT—PZ+QW= g 
uuz 
(A.9) 
US+VT—-QZ—PW=0. 
Substitution of (A.9) in (A.5) yields 
1 dko 2 1 luu +2(UT—Q W) 
Ag du, i uz EP , 
(A.10) 
1 9B, 2 E W-—VT) 
Bo Ou, i uz Bo i 
Substitution of (A.4) and (A.10) in (A.1) gives 
es, a (- mi 2) zc amc A 
ET = Uy Ag, | — San irm , * 
m M Tes uy Do Ao Uu; Ao Do 
r r 4 ( 2A4o zx) à, 2S (A.12) 
—Ap, = Uy Ay ——|-— $ 
x Po uiis uy Bo Bo uy BoDo 
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0-10 
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400-00 
500-00 
600-00 
800-00 
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1200-00 
2000-00 
3000-00 
4000-00 


4-095 


—0-254 
—0-253 
—0-247 
—0-236 
—0-220 
—0-199 
—0-175 
—0-112 
—0-032 
0-066 
0-627 
1:677 
3-050 
4-666 
6:440 
10:113 
13:511 
16-290 
20-715 
19-127 
16:871 
15-258 
14-117 
12:557 
11-501 
10-742 
9:116 
8-113 
7:333 
6:652 
6:073 
5:202 
4:608 
4-181 
3-193 
2:563 


TABLE II 


—1-651 
—1-651 
—1-651 
—1:651 
—]1:651 
—1:651 
—1:651 
—1:651 
—1:651 
—1:651 
—]:651 
—1-651 
—1-651 
—1:651 
—1:651 
—1-651 
—1:651 
—1:651 
—1:651 
—1:651 
—1:651 
—1:651 
—1:651 
—1:651 
—1-651 
—1-651 
—1:651 
—1:651 
—1-651 
—1:651 
—1:651 
—1-651 
—1:651 
—1:651 
—1-651 
—1:651 


2114 
2:129 
2:188 
2:262 
2:334 


| 2:405 


2.474 
2-607 
2-730 
2-843 
3-188 
3-413 
3-487 
3-474 
3-414 
3-239 
3-059 
2-913 
2-672 
2:639 
2:597 
2:539 
2:482 
2:385 
2-305 
2-235 
2:046 
1:971 
1:982 
2:011 
2:034 
2-054 
2:055 
2-050 
2:042 
2:040 
2-039 


Ar At he A, As 
. 107° . 1075 . 1075 . 107? . 107? . 


1-965 
1-950 
1-890 
1-816 
1-742 
1-670 
1-599 
1-462 
1-333 
1-213 
0-827 
0-528 
0-359 
0-263 
0-206 
0-153 
0-134 
0-128 
0-121 
0-103 
0-087 
0-076 
0-069 
0-059 
0-053 
0-048 
0-039 
0-033 
0-029 
0-026 
0-023 
0-018 
0-015 


0-013 


0-008 
0-005 
0-003 
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TABLE II (continued) 


Îr Ay he 
7 i |} .1075 .1075 f|. 


5000-00 

6000-00 0:017 

8000-00 0-016 
10000-00 0-015 
12000-00 0-014 
20000-00 0-012 
30000-00 0-011 
40000-00 0-011 
50000-00 0-010 
60000-00 0-010 


80000-00 
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NOVEL SEMICONDUCTOR SWITCH WITH 
CONTROLLABLE DELAY AND RAPID TURN-ON 


by M. V. WHELAN and L. A. DAVERVELD 


Abstract 


The construction and principle of operation of a semiconductor switch 
made in silicon and results of its behaviour are presented. The device 
has a controllable delay (us-minutes) before switching rapidly from its 
out (10+! O) to its on (300 Q) condition. The delay can be varied over a 
wide range simply by varying the forward current through a p-n junc- 
tion. An external capacitor determines the range of time delays. 


This communication is concerned with a switch we have made. The device 
.has a controllable delay before switching rapidly to its on condition. It is this 

latter characteristic in particular which distinguishes our device from one which 
came to our attention while preparing our manuscript +). The delay in our 
device can be varied over a wide range simply by varying the forward current 
through a p-n junction *). An external capacitor is used to change the range 
of time delays. 

Before considering the actual device we made, let us consider first the opera- 
tion of a simpler type of switch. A top view and a cross-section of this are shown 
in fig. 1; only half of the device is shown since it is symmetrical about the 
line ġġ’. The n* region 1 is completely surrounded by the n+ region 2 which 
in turn is surrounded by the p* region 6. The poles of the switch are the con- 
tacts to the two n* regions 1 and 2. 

The foregoing device operates as follows. A negative voltage is applied to the 
MOS gate 3 and the p substrate 5, with respect to the relatively thin epitaxial 
n-type layer 4. This results in a depletion region in the silicon under the MOS 
gate 3 and also between the p substrate 5 and the n epi-layer 4. Because the p* 
buried layer 8 is more heavily doped than the p substrate 5, this latter depletion 
region will extend further into the n epi-layer near the buried layer. Furthermore 
the depletion region which extends downwards into the silicon under the elec- 
trode 3, can be quite extensive for the following reason. The reverse bias between 
the p region 5 and the n epi-layer 4, will prevent the formation of an inversion 
layer of holes at the SiO,—Si interface. If the voltage between 3, 5 and 4 is 
sufficiently large, the n epi-layer under 3 and above the buried layer 8, will 
become completely depleted, as indicated schematically in fig. 1a. This deple- 
tion layer isolates the inner nt region 1 from the outer nt region 2. It is essential 
that this occurs without depleting completely the n layer between the p* region 6 
and the p substate. The reason for this will become apparent. When the switch S 
is opened, the recovery of conduction between the two n* regions is delayed 


*) Incident light can also be used instead of the p-n junction. 
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Fig. 1. A simplified type of switch. The device is symmetrical about the centre line $4". 
(a) Cross-section view, (b) top view. 


because the capacitor C will first have to be discharged. The rate of discharge 
can be regulated by varying a forward bias applied between the p* region 6 
and the n epi-layer: this bias causes holes to be injected across the n epi-layer 
into the p substrate, these cancel the negative charge on the capacitor plate 
connected to 5. The p+ region 6, the n epi-layer 4 and the p substrate 5 form 
the emitter, base, and collector, respectively, of a transistor: the base contact 
is region 2. 

Figure 2a shows schematically the behaviour of conduction between the two 
n* regions vs time, with the forward voltage on region 6 as parameter. These 
characteristics are not ideal. Characteristics of the type sketched in fig. 2b would 
be more desirable. These indicate no conduction for a period, which can be 
varied, and then a rapid transition from the out to the on condition. We will 
now describe a device we made and which exhibits these ideal characteristics. 

A cross-section of our actual switch is shown in fig. 3. It is very similar to the 
foregoing device except for the addition of the p* region 7, the groove *) 10 
and the p* buried layer 9 under the groove. All three completely surround the 
other regions. The switch operates as follows. A sufficiently large negative voltage 


*) In a more recent modification, instead of a groove, a gate electrode is placed on the oxide, 
and is connected to electrode 3. . 
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Fig. 2. (a) Qualitative indication of switching characteristics for simplified device; c,,2 is 
the conduction between regions 1 and 2. (6) Qualitative indication of desirable switching 


characteristics. 


Fig. 3. Cross-section of our actual switch. 


is applied to regions 3 and 5 with respect to the region 2, so that the inner 
n* region is isolated from the outer n* region by a depletion region. Now the 
p* region 7, the n layer 4, and the p substrate 5 represent the emitter, base, 
and collector, respectively, of a second transistor. The base contact is region 2. 
The base current of this transistor will have to flow along the n layer and 
between the groove 10 and the underlying p* buried layer 9. If the voltage ap- 
plied to the capacitor C is sufficiently large, then the n region between the 
p* buried layer 9 and the groove 10 will be completely depleted. This occurs 
before the n layer between regions 6 and 7 and the p substrate becomes de- 
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pleted. This is so, for reasons already explained in connection with the buried 
layer 8. The foregoing situation is indicated schematically in fig. 3. Due to the 
depletion region between the groove 10 and the buried layer 9, the second 
transistor composed of regions 7, 4 and 5, and which has its base contact at 
region 2, will receive no base current. The emitter 7 is isolated from the base 
contact 2 *). When the switch S is opened, the capacitor C will be initially 
discharged by holes injected across to region 5 by the p* region 6. When the 
voltage on C however is reduced to a certain value (no measurable conduction 
yet between the two n* regions 1 and 2), the second transistor 7, 4, 5 which 
has a large forward bias between regions 2 and 7 will begin to get base current 
due to the appearance of a thin neutral n region between the groove 10 and 
the p* buried layer 9. Holes now injected from region 7 across the n epi-layer 4 
to the substrate 5, begin to discharge C. This in turn leads to a further decrease 
of the depletion between the groove 10 and region 9, which in turn leads to a 
further increase of the base current and a further increase of charge injection 
by 7. A type of snowball effects occurs. Once this begins, because of the relative- 
ly large forward bias between 2 and 7, the capacitor C will be rapidly dis- 
charged. Thus, to summarize, the voltage on region 6 determines the delay 
before the rapid switching-on occurs. Region 7 is responsible for the rapid 
turn-on at the end of the delay. The value of tbe capacitor C can be varied to 
change the range of delays. The poles of the switch are regions 1 and 2. 

Before considering some measurements on the foregoing device some techno- 

logical information for a completed device is given: 

— oxide thickness under the gate electrode 3 is 2000 A; 

— epitaxial n layer is 1-9 um thick, doping level 3. 10!5/cm?; 

— substrate: p type (1005, 10-40 O cm; 

— buried p layer: sheet resistivity 280 O/[]; 

— regions p*: 6 and 7, 0-6 um deep; after deposition sheet resistivity is 
100 Q/[: areas of regions 6 and 7 were 4-4 . 107^ cm? and 74 . 107^ cm?; 

— distance between the top of the buried layer and the top of the n epi-layer 
1 um; 

— depth of groove 0-7 um; this was defined in some samples by chemical 
etching; in others by oxidation with silicon nitride as a mask. 

The resistance between the two n* regions 1 and 2 (fig. 3) for the on con- 
dition of the switch is 300 Q. The conduction between these regions was reduced 
by 10° when the switch was in its off condition. This was attained for a voltage 
of 4 volts between regions 3 and 5 with respect to 4 (fig. 3). The base current 
of the transistor (regions 7, 4, 5) which is responsible for the rapid transition 
from off to on condition was reduced by 10° when the switch was in its off 
condition. This was attained for a voltage of 4-75 volts between the regions 3 


*) A transistor structure in which the base current could be reduced to zero by increasing 
the collector voltage has also been considered elsewhere ?:?). 
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and 5 with respect to 4. Thus when the voltage on capacitor C (fig. 3) reaches 
4:75 volts the transistor 7, 4, 5 becomes active. Measurements have shown that 
the transistor is fully active when the voltage on C reaches 4 volts and while 
the conduction between the poles (1 and 2) of the switch is still a minimum. 
"The foregoing thus means a rapid transition from out to on condition for the 
switch. i : 

The variation of conduction between the n* regions 1 and 2 (fig. 3) versus 
time, with voltages on the p* regions 6 and 7 as parameters, is shown in fig, 4. 
The capacitor C was charged to —5-4 volts. As can be seen the desired switching 
behaviour is obtained in figs 4a and b. The influence of lowering the voltage 
to the p* region 7 is shown in fig. 4c. The time scales for figs 4b and c are 
alike. 


V =450mV 


-3 
3610s V= 400mV 


c) ? —Time 


Fig. 4. Measured switching characteristics for the device of fig. 3; (6) and (c) indicate the 
influence of V;. The time scales are alike for these two cases. 


Figure 5 shows measured and calculated values of the delay time v versus 
the voltage on the p* region 6 (fig. 3). This region is used to vary v. This c 
is exponentially dependent on the voltage applied to the p* region. This de- 
pendence is expected since the current injected across the n layer into the 
substrate is the collector current of the transistor formed by regions 6, 4 and 5. 
The ranges of v shown in fig. 5 were changed by altering the value of the 
capacitor C. The change was proportional to the change in C. 
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Fig. 5. Measured and calculated behaviour of the delay time versus the voltage Vg for the 
device of fig. 3 (T — 298 K: initial voltage on C was —5:4 V). 
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MINORITY-CARRIER INJECTION AND TRANSIENT 
RESPONSE OF A MOS CAPACITOR 


by M. V. WHELAN 


Abstract 


'The influence of minority carriers injected by a p- junction across a 
quasi-neutral n layer on the transient response of a MOS capacitor is 
measured and explained quantitatively. 


1. Introduction 


Our object here is to illustrate and explain quantitatively the influence of 
minority-carrier injection from a p-n junction across a quasi-neutral n layer 
on the transient response of a MOS capacitor. 


2. Measurements 


The device used is illustrated in the inset in fig. 2. The approximately 4 um 
thick n layer (~ 10!5/cm?) was epitaxially grown. The inner and outer diam- 
eters of the gate electrode are 150 and 750 um. The thermally grown oxide 
was 0:25 um thick. 

A bias is applied to the gate electrode with respect to the n+ region and is 
sufficiently large to cause considerable inversion of the underlying silicon sur- 
face. When the gate voltage is suddenly changed and held at a larger negative 
value, the depletion region under the gate becomes larger than its thermal- 
equilibrium value '*?). As minority carriers are generated in the space-charge 
region or injected into it, the depletion region relaxes to a thermal-equilibrium 
width which corresponds to the new gate voltage. Here we are interested in the 
influence of injection of minority carriers on the speed of relaxation of the 
depletion region. The injection is caused by a forward bias applied between 
the n layer and the p* substrate. | 

The relaxation of the surface space-charge region is monitored by measuring 
the capacitance (30 kHz) between the gate and the n* contact. The out voltage 
step is sufficiently small *) to ensure that the increase in the width of the sur- 
face depletion region, immediately after the change of gate voltage is small 
compared to the width of the n layer. 


3. Analysis of measurements 
After application of the voltage step to the gate we assume that the hole 


*) The change in the width of the surface space-charge region immediately after changing 
the gate voltage is 0-2 um. 
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concentration at the edge of the surface space-charge region is reduced to a 
* value small compared to the thermal equilibrium value p,o. The hole concen- 
tration at the edge of the depletion region between the n layer and the p* region 
is Pao exp (f Vyr). If L is the distance between the edges of the two foregoing 
space-charge regions and if we neglect the change of L with the forward bias 
Vr then the rate of diffusion of holes into the surface space-charge region is 
J, Dy Pno exp (B Vir) 
—=— ; (1) 
q L 
D, is the hole diffusion constant; B = q/kT; q, k, T represent electron charge, 
Boltzmann constant and temperature, respectively. 

Before and after the application of the voltage step the thermal-equilibrium 
width of the surface space-charge region is practically the same ?) so that the 
number of minority carriers supplied to the inversion layer to bring the region 
into thermal equilibrium after the voltage step AV is 


AV 
= Cors (2) 
q 


where C,, is the oxide capacitance per cm?. Equations (1) and (2) yield the 
time ¢ needed to supply P by diffusion across the n layer: 


AV 
t= Cox L exp (6 Vyr). (3) 
q D p Pno 
Figure 1 shows a typical curve of the relaxation of the surface space-charge 
capacitance to its equilibrium value after the application of a voltage step. 
With such figures 7 was measured versus V;p. Figure 2 shows a plot of 
log ¢ versus B Vyp. For values of V; > 120 mV, t is indeed proportional to 


Gate voltage changed 


WESS SENIEEBER 
zd h- Hin Surface space 
| | | LA- charge again 


"m paf in equilibrium 
output p=: Ji 


Fig. 1. Example of transient response of a MOS capacitor. Shown is the output signal of the 
capacitance bridge, which was balanced before voltage change on the capacitor. 
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Fig. 2. Plot of log t versus $ Vyp. The recovery time of the MOS capacitor is denoted by t; 
the forward voltage causing injection of minority carriers into the surface space-charge region 
is denoted by Vyp. 


exp (— Vyp) as predicted by the very simple model above. The departure 
from the straight line for Vyp lower than 120 mV, we attribute to thermal 
generation of minority carriers inside the surface space-charge region becoming 
important. 
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SILICON-SURFACE SPACE-CHARGE 
STRONG-INVERSION CAPACITANCE AND 
MINORITY-CARRIER INJECTION 


by M. V. WHELAN 


Abstract 


The capacitance of a MOS diode is measured under the condition of 
strong inversion at the silicon surface: in particular we study the 
influence on this capacitance of minority-carrier injection by a forward- 
biased p-n junction, across a quasi-neutral n layer. 


1. Introduction 


The object of this communication is to illustrate and explain the measured 
influence of minority-carrier injection from a p-n junction on the silicon-surface 
space-charge capacitance for the condition of strong inversion at the silicon 
surface. Grove and Fitzgerald +) studied the influence of injection on the sur- 
face potential at which inversion begins at a silicon surface. Their structure 
did not lend itself to study the behaviour of the silicon-surface space-charge 
capacitance after the formation of an inversion layer. 


2. Measurements and interpretation 


Our structure also considered earlier ?) is similar to one used by Rosier ?). 
It is illustrated in the inset (a) in fig. 1. The capacitance between the n* region 
and the gate electrode (inner and outer diameters 150 and 750 um), was 
measured (100 kHz) for different forward voltages Vp applied between the 
n* and p* regions: the resulting C-V curves are shown in fig. 1. 

We consider the capacitance C, as a function of the voltage V;, for gate 
voltages V, (< —15) corresponding to strong inversion at the silicon surface. 
We consider the capacitance C, (see inset fig. 1) to consist of two lumped 
capacitances in series, AC,, determined by the oxide thickness and the area A 
of the gate electrode, the other due to some equivalent series capacitance C,. 
The values of C, obtained from the curves of fig. 1 for V, < —15 volts, are 
tabulated versus V;, in table I. Also tabulated are the calculated values of 
silicon-surface space-charge capacitance A Cp = A &g Za 1 (area A of the gate 
electrode is 0-425 . 107? cm?; A C,, = 65-3 pF; equilibrium electron and hole 
concentrations in the about 3 um thick n epi-layer are n,o = 7:2.10!5 cm7? 
and p,o = 2:7 . 10* cm~? at 300 K; & is the dielectric constant of the silicon; 
B = g[kT'; q, k, T denote electron charge, Boltzmann's constant and absolute 
temperature, respectively). The width z, is 
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Fig. 1. Small a.c. signal (100 kHz) equivalent series capacitance measured between the V, 
contact and the n* region (see inset (a)) versus the d.c. gate voltage V, with the forward 
diode voltage Vyp as parameter. 


-4 
z= É EN 2l . (1) 
qno Ê Pno €Xp (B Vir) 
It is based on the depletion approximation *) for the surface space-charge layer 
and the following assumptions: (i) the injection of holes in the n layer is small 
compared to 75,6, (ii) the n layer is sufficiently thin compared to the hole diffu- 
sion length so that the concentration of holes at the edge of the surface space- 


charge layer is Pao exp (B Vyr), (iii) z, becomes independent of surface poten- 
tial once strong inversion exists at the silicon surface. 


TABLE I 
Calculated values of A Cp and the measured values of C, versus Vj, 


[ [o [as [2 [9 [55 [9 


A C, (measured) (pF) 133 | 142 | 156 184 | 394 | 4500 


A Cp (calculated) (pF) 214 
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For values of V;, up to 300 mV the change in the silicon-surface space-charge 
inversion capacitance can be attributed to a change of the width of the surface 
space-charge layer. For larger values the measured capacitance increases more 
rapidly with V;& than can be attributed to a change of z}. We consider the 
possibility of this being due to the minority carriers (holes) in the inversion 
beginning to follow the a.c. measuring signal. 

If the frequency response of minority carriers in a strong inversion layer is 
determined by the rate of diffusion of minority carriers from the quasi-neutral 
bulk at the edge of the surface space-charge layer an equivalent lumped circuit 
which governs this situation has been suggested *) and is shown in inset (b) 
in fig. 1, together with some equivalent series forms of the same circuit. The 
small signal resistance Rp/A is associated with the diffusion of holes from the 
quasi-neutral bulk to the space-charge layer. In a manner analogous to that 
used in ref. 4 it can be shown that 


Rp L 
A AqD,B Pro exp (B Vy) 


D, is the hole diffusion constant, L denotes the thickness of the quasi-neutral 
n layer (3 pm). This expression differs from that of ref. 4, to the extent that it 
includes the influence of V; and the thinness of the n layer compared to the 
diffusion length of holes in this layer. Upon referring to inset (b) in fig. 1 it can 
be shown that 


(2) 


A (40 C) (= y G) 
—— == w = € ; 
A NG 
w is 2x times the measuring frequency. 

We now consider for values of V;; the measured values of C, when they no 
longer equal A Cp. We assume the disagreement is due to Rp/A; by assuming 
that A Cp could be adequately described by A &s;/z,, where z, is given by 
eq. (D), and using the measured values of C, and eq. (3) we obtained values 
for Rp/A. Using eq. (2) we also calculated values for Rp/A. The two sets of 
values are tabulated versus V;, in table II. 


TABLE II 
Rp/A versus Vs, extracted from the measured values of C, and also calculated 
using eq. (2) 


Rp[A (measurements) 


Rp/A (calculated eq. (2)) 


3-2. 10* 
4-8 . 104 


7-9 . 103 | 17.10? 


6-9 . 10? 1-0. 10% 
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The relatively good agreement between the calculated and measured values 
of Rp/A in table II indicate that at higher values of V;. the increase in the 
silicon-surface space-charge capacitance can be attributed to the increase in 
the frequency response of the minority carriers in the inversion layer, and that 
the silicon-surface space-charge capacitance can be adequately described by 
the equivalent circuit in inset (b) of fig. 1. 
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BaFCI:Eu?^*, A NEW PHOSPHOR FOR X-RAY- 
INTENSIFYING SCREENS 


by A. L. N. STEVELS and F. PINGAULT *) 


Abstract 


A number of phosphors for X-ray-intensifying screens have been 
evaluated by calculating figures of merit. On use in combination with 
standard (“blue”-sensitive) X-ray film, BaFCl:Eu?* and BaFBr:Eu?* > 
give better performance than the traditional CaWO4 and more ' 
recently developed UV- or blue-emitting materials (e.g. sulphates and 
y-oxysulphides). The calculated figures of merit of BaFCl:Eu?* or 
BaFBr:Eu?* /standard-film combinations are comparable to those of 
Gd4O5S:Tb/green-sensitive X-ray film systems. The preparation of 
optimal fluorohalide:Eu? * phosphors involves proper formation of the 
host lattice, complete reduction of Eu?* ions as well as elimination of 
afterglow. Measurements on powders and experimental screens indicate 
that, by using BaFCl:Eu?* screens in radiography, important dose 
reductions can be achieved without the necessity of using other than 
standard (“blue”-sensitive) X-ray films. 


1. Introduction 


In the past several attempts have been made to replace CaWO,, as a material 
for X-ray-intensifying screens. Among these phosphors, ZnS:Ag, BaSi,O,:Pb 
and BaSO,:Pb had some emphemeral success. In the long run, however, their 
advantages did not surpass the drawbacks in such a way that these materials 
rendered CaWO, obsolete, even for special applications. 

More recently a large number of new materials have been found, which, in 
theory, can be used in radiography. These phosphors include: 

— alkali-earth sulphates and phosphates: (Ba, Sr)SO, :Eu?*, Ba3(PO,), :Eu?* ; 
— cesium iodide doped with Na or TI; 

— alkali-earth fluorohalides (BaFCl:Eu?*, BaFBr:Eu?*); 

— rare-earth oxysulphides (Gd,O;S:Tb, Y,O;S:Tb). 

Some of these materials have been discussed more or less extensively in litera- 
ture 176), but no systematic evaluation of the phosphors has been effectuated. 
On the one hand, practical problems such as the preparation of optimal mate- 
rials and the measurement of absorption as a function of X-ray energy make 
this a cumbersome task. On the other hand, such a large number of properties 
have to be taken into account that it is difficult to define a figure of merit 
covering them all. 


*) Massiot-Philips, 80690 Ailly-Le-Haut-Clocher, France. 
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In the present paper we define a figure of merit (which is in fact an X-ray 
quantum-detection efficiency of phosphor-screen-film combinations) calculated 
from X-ray absorption and luminescence and detection-efficiency data. X-ray 
noise and the contrast rendition of screens are not included in this figure but 
it will be found that the classification of the phosphors by quantum-detection 
efficiencies is not changed by taking these into account. 

For phosphors to be used in intensifying screens we find that BaFCI:Eu?* 
and BaFBr:Eu?* have high figures of merit. These fluorohalides have superior 
properties compared with other UV- or blue-emitting phosphors, like CaWO,, 
(Ba, Sr)SO,:Eu?* or Y,O,S:Tb, and this prompted further investigation at 
our laboratories. CsI:Na has a figure of merit which is even better than for 
BaFX:Eu?* (X = Cl, Br) but this material is not suitable for practical ap- 
plication in intensifying screens since its luminescence is adversely affected by 
moisture in the air. The present paper discusses the problems of preparing the 
fluorohalides. In addition, a number of measurements are presented which 
substantiate the good performance of BaFCl:Eu?+ and BaFBr:Eu?* predicted 
by the calculations. 


2. Calculation of X-ray quantum-detection efficiences 


2.1. Phosphors to be evaluated 


In the present calculations we consider the following phosphors: 

(1) BaFBr:Eu?* and BaFCl:Eu?*. Their luminescence upon UV and C.R. 
excitation has been described recently by Sommerdijk et al.?). 

(2) (Ba, Sr)SO, :Eu?* 9). On partial replacement of Ba by Sr optimum X-ray- 
luminescence properties are found. These are better than for the related 
compound Ba,(PO,), :Eu?* °), so that the latter will not beconsidered further. 

(3) CaWO,, the traditional phosphor for intensifying screens. 

(4) CsI:Na. Evaporated layers of this phosphor have replaced (Zn, Cd)S:Ag 
powder screens in a new generation of image-intensifier tubes (e.g. refs 1 
and 2). 

(5) CsI:Tl. An alternative proposed for the blue-emitting CsI:Na. The CsI:TI 

' phosphor has a broad (“white”) luminescence band. 

(6) Gd,O;S:Tb. This phosphor has been proposed recently by Buchanan et al. 

`~ (see e.g. ref. 6) for intensifying screens. In contradistinction to the other 
phosphors in the list, Gd,O;S:Tb is mainly green- (and even red-) emitting. 
To make full use of properties of this phosphor, the use of other than 
standard (UV- and blue-sensitive) X-ray film is necessary. 

(7) Y,0,S:Tb. A material related to Gd,O,S:Tb 5). Its lower X-ray absorp- 
tion than e.g. that of the Gd compound is compensated by its high radiant 
efficiency and, on use of standard X-ray film, its favourable blue emission 
colour. 


-— 
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2.2. The figure of merit 


For a given X-ray energy the figure of merit of a phosphor screen is defined 
as follows: 


F.M.g = Ag 7) gm Cz Dy, 


where Ag = the X-ray absorption of the screen; 
n =the luminescence radiant efficiency under cathode-ray excitation 
of the phosphor considered; 
Aga = the mean wavelength of the emission detected by the X-ray film; 
Cg = a correction factor for energy losses in the screen; 
D, = the efficiency of detection of the screen emission (J,) by a film 
with response R;; D, is defined as J IR, dA I, dà. 


The X-ray absorption coefficients at X-ray energies in the 20-100-keV range 
were calculated from the tables published by Storm and Israél!9). Both 
coefficients Lrot.en and fMiot,ars Were determined. The former allows for the 
escape of all secondary radiation, while the latter does not. In actual practice 
part of the secondary X-rays generated will be re-absorbed so that the true 
absorption will lie between the values calculated from the two coefficients 
Hrot,en ANd fiot,avs- Figures for the re-absorption were calculated with the aid 
of a method proposed by Oosterkamp ++). X-ray-absorption data will be 
presented below for 200 um thick screens. The packing density is taken as 
50% for powder screens and equal to 100% for evaporated screens of CsI:Na 
and CsI:Tl. It can be shown that the data obtained for 200-um screens are 
significant in the range of thicknesses between 50 and 400 uum; on comparison 
of the various phosphors the relative error with respect to the results for 200-um 
screens will not exceed 20% and is less than 10% in most cases. The latter de- 
viations are small compared to the differences in the actual figures of merit. 
The luminescence radiant efficiencies under C.R. excitation were measured at 
20 kV (see table I). In general these data also apply to X-ray excitation (see 
e.g. refs 4 and 12) if corrections are made for light lost in the screen (these are 
included in the factor Cg). The factor gm converts energy efficiencies into quan- 
tum efficiencies and is derived from the emission spectra given in figs 1, 2, 3 
and 4. The factor Cz includes energy losses by scattering which cause a difference 
between energy absorption and attenuation of the X-ray beam, losses in the 
lacquer (powder screen only) and losses due to photo-electrons leaving the 
screen. A further correction considers the self-absorption of the fluorescent 
light. If we take C equal to 1 for a screen without losses we find, dependent 
on the material and the X-ray energy considered, C = 0:7-0-8 for vapour- 
deposited screens and C = 0-5-0-7 for “fine-grained” powder screens. For 
thicknesses of 200 um the figures for C for “front-screen” application (cf. 
ref. 13) is about equal to that for “back-screen” application. The data for D; 
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TABLE I 
Cathode-ray efficiencies and D, data of X-ray phosphors 


20-kV C.R. 
radiant efficiencies (97) 


detection efficiency 


D, C9 


phosphor 


BaFCl:Eu?* 13 96 
BaFBr:Eu?* 96 
BaSO, :Eu2* 94 
CaWO, 81 
CsI:Na 85 
CsI:Tl 21 
Gd,O,S:Tb 13 


Y,0,8:Tb 


given in table I were calculated from the emission data presented in figs 1-4 
and the response of a standard (UV- and blue-sensitive) X-ray film and a 
green-sensitive X-ray film (in the case of Gd,O;S:Tb and CsI:TI). 


2.3. Results 


The results of our calculations are presented in figs 5-8, which show the 
calculated figures of merit plotted versus X-ray energy. In order to simplify 
a direct comparison at a particular X-ray energy, a relative scale has been 
introduced by taking the figure of merit of CaWO, equal to 1 independent 
of the X-ray energy. In figs 5-7 standard X-ray film has been used as detector 
of the screen emission; in fig. 8 figures are calculated for the use of green-sen- 
sitive films. 

Figure 5 indicates that the performance of BaFCI:Eu?* and BaFBr:Eu?* 
screens is superior to that of CaWO, screens between 20 and 100 keV. Between 
37 and 69 kV, that is over almost the entire diagnostic range, F.M.,, is 3:1 
and 4-7 (37 keV), respectively, and increases to even 5:3 and 7-4, respectively, 
at 69 keV. The main parameter contributing to these large differences is the 
high radiant efficiency of the fluorohalides: BaFCI is almost 4 times better than 
CaWO, (13 vs 3-57) in this respect and BaFBr better still (16 vs 3-597). The 
increase is only partly due to a better X-ray absorption. This accounts for a 
factor up to 1:2 and 1:3, respectively, near 69 keV. Above that energy the X-ray 
absorption of CaWO, is 1-5-2-0 times better than that of the fluorohalides but 
this is more than compensated by the better radiant efficiency of the latter. In 
this energy range F.M.,,, of BaFCI:Eu?* is 2-5-3. For BaFBr:Eu?* we find 
3-4-5. |: 

It can be seen in fig. 6 that the performance of CsI:Na screens is also superior 
to that of CaWO, screens, F.M.,,, being 4-16 between 20 and 100 keV. This is 
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Fig. 1. Emission spectra of BaFCl:Eu?* and BaFBr:Eu?* (coinciding, -——: ), BaSO4:Eu?* 
(...) and CaWO, (- — . The spectral response of a standard X-ray film is represented by 
a drawn curve. 
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— A(nm) 
Fig. 2. Emission spectra of CsI:Na (- - -), CsI:TI (7: ~.~) and CsBr:TI (. . .). The spectral 
response of a standard X-ray film is represented by a drawn curve. 


— A(nm) . 
Fig. 3. Emission spectrum of Gd;O;S:Tb. The spectral response of a standard X-ray film 
is represented by the dashed curve 1, that of a green-sensitive film by curve 2. 
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Fig. 4. Emission spectrum of Y4;O5S:Tb (low Tb content). The spectral response of a stand- 
ard X-ray film is represented by the dashed curve 1, that of a green-sensitive film by curve 2. 
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Fig. 5. Relative figure of merit vs X-ray energy for 200-um screens of BaFCl:Eu?*, 
BaFBr:Eu?* and CaWO,. The screens are in contact with standard X-ray film. 
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F.Mret 
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——» X-ray energy (keV) 


Fig. 6. See fig. 5. The drawn curve represent figures of merit for Csi:Na, for CsI:Tl and 
for CaWO,. - 


mainly due to the better luminescent-energy efficiency of CsI:Na and to the 
fact that this phosphor can be evaporated. Vapour-deposited screens have ap- 
proximately twice the packing density of powder screens and therefore a higher 
X-ray absorption and lower X-ray quantum noise. Moreover, the factor C 
(see above) in these screens is higher. A further important advantage of using 
evaporated screens would be that technologies can be employed that result in 
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the suppression of lateral light leakage and thus in improvement of the resolu- 
tion of the screen ?). 

The drawback that renders CsI:Na unsuitable for use in X-ray-intensifying 
screens is the hygroscopicity of the material. The decrease of the luminescence 
in air is so fast that even if protective layers were employed, screens with an 
acceptable lifetime could not be produced *). This is why, although the figure 
of merit of CsI:Na is 1-5-3 times higher than that of BaFCl:Eu?*, we have 
not considered this phosphor further for use in intensifying screens. 

CsI:Tl is much less hygroscopic, but the outperformance of CaWO, is much 
less and in fact restricted to the 36-69-keV range. If we compare it with the 
fluorohalides no advantage exists at all. The unfavourable emission colour of 
CsI:T1 compared with the standard X-ray film is responsible for this (cf. fig. 2). 
In contradistinction to the iodide, CsBr:Tl has mainly a near-UV emission !*). 
The properties of this Tl-doped bromide are in some aspects more similar to 
those of CsI:Na. One of these, the hygroscopic character of the material, seems 
to be prohibitive for practical application. 

In fig. 7 we have brought together data on Y,O,S:Tb, Gd,O,S:Tb, 
BaSO,:Eu?* and CaWO,. In this group Y,0,S:Tb tends to be the most 
promising material with a figure of merit 3-5 times better than CaWO, below 
69 keV. Compared to the fluorohalides Y ,O;S:Tb performs slightly better 
below 37 keV, but its behaviour above that energy is inferior. In the diagnostic 
range, BaFCl:Eu?* outperforms the oxysulphide 1:0 resp. 1:6 x at 40 keV, 
1-4 resp. 20x at 60 keV and 1:8 resp. 2-5 X at 80 keV. 

We note that in Y,O;S:Tb a high luminescence radiant efficiency has to 


BaS0, :Eu 
Ca WO, 


0 20 40 60 80 100 120 
— X-ray energy (keV) 


Fig. 7. See fig. 5. The drawn curves represent Y202S:Tb (low Tb content), the dashed curve 
Gd505S:Tb, the dash-dotted curve BaSO4:Eu?*. CaWO, is represented by a drawn line. 


*) On using CsI:Na in vacuum tubes (e.g. as phosphor in input screens of image-intensifier 
tubes) the hygroscopy problem does not exist. CsI:Na is than clearly superior: This has 
been analysed in some detail in e.g. ref. 2. R 
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compensate for a relatively low X-ray absorption. The latter also gives rise 
to a relatively high X-ray quantum noise. Above 50 keV the related Gd,O,S:Tb 
has a much better X-ray absorption, but the response of the standard X-ray 
film to its mainly green emission is very low. The calculated F.M.,,; are better 
than CaWO, in the 50-69-keV range only. Outside this area the performance 
is not so good. 

For BaSO,:Eu?* the F.M.,, curve has a similar form as for BaFCl:Eu?* 
and BaFBr:Eu?*. The figures of merit are lower by a factor of more than 2 
resp. more than 3, however. 

In fig. 8 we give the calculated figures of merit of some phosphor screens 
used in combination with green-sensitive films. In order to compare with figs 
5-7, we use for CaWO,, in fig. 8, F.M. data derived for standard film. For 
simplicity it is assumed that the overall sensibility of the green film is equal to 
that of the standard film (it may be doubted, however, whether in practice such 
sensibilities can be reached). The performance of CsI:TI with green film is as 
good as that of CsI:Na used in combination with standard X-ray film. Since 
CsI:Ti is much less hygroscopic than the Na-doped phosphor it seems to be 
a very promising material. Our first tests with vapour-deposited CsI:Tl screens 
show indeed results as good as suggested by fig. 8. However, it was also found 
that after X-ray exposure of CsI:Tl screens a strong afterglow occurred, which 
is unacceptable in radiological practice. 

The calculated performance of Gd,O;S:Tb screens (used in combination with 
green-sensitive X-ray film) is less than that of CsI:TI ones, but the application 
problems seem to be less. In fact such screens are being marketed now all over 
the world. On comparison of Gd,0,8:Tb (fig. 8) with the BaFX:Eu?* phos- 


) 20 40 60 80 100 120 
——» X-ray energy (keV) 


Fig. 8. See fig. 5. Data are given for Gd,O5S:Tb and CsI:Tl in contact with a green-sensitive 
X-ray film, and for CaWO, in contact with a standard X-ray film. 
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phors (fig. 5) we see that in the 20-37-keV range BaFBr:Eu^?* is slightly better, 
whereas Gd,O,S:Tb is in turn slightly better than BaFCI:Eu?*. In the 37-50- 
keV range, the fluorohalide-Eu?* phosphors outperform Gd,O,S:Tb; between 
50 and 69 keV Gd,O,S:Tb is clearly superior. Above 69 keV BaFBr:Eu?* can 
compete with the oxysulphide; the figure of merit of BaFCl:Eu?* is less. 

In practice, where polychromatic X-rays are used, a general conclusion may 
be that BaFBr:Eu?* and BaFCl:Eu?* have a performance comparable to 
Gd,O,S:Tb, and have the important advantage over the last-named material 
that no special films have to be used. 


3. Preparation of BaFCl:Eu?* and BaFBr:Eu?* phosphors 


3.1. General considerations 


In the first instance samples of BaFCI:Eu?* and BaFBr:Eu?* were prepared 
by milling dry BaF,, BaCl, c.q. BaBr, and EuX; (X = F, Cl, Br) and heating 
the mixture at 700-850 °C in a slightly reducing atmosphere consisting of N, 
and H,. As a result a large variety of white to bluish grey products were ob- 
tained. Some samples consisted of well crystallized coarse grains, others of 
irregularly intergrown small crystallites only. The measured light outputs ranged 
from practically zero to about the maximum values predicted by our calcula- 
tions. 

All samples prepared in the way described above were found to have an 
unacceptable afterglow after X-ray irradiation (see sec. 3.3). For BaFCI:Eu?* 
the afterglow was 10-30 times as intense as for CaWO,, while for BaFBr:Eu?* 
this factor was up to 100 x. Moreover, after milling in order to get an appropriate 
grain size, these high afterglow levels had increased further. 

In view of the foregoing the problem of preparing fluorohalides suitable for 
use in X-ray-intensifying screens is twofold, namely: 

(1) on the one hand, to find the proper conditions for the formation of the 

host lattice, the reduction of Eu?* and incorporation of Eu?* ions; 

(2) on the other hand, to avoid the formation of afterglow centres. 

As will be seen below, these objectives do not conflict but are to a large extent 
parallel. It is now possible to prepare samples with light output close to the 
calculated (maximum) values. In the case of BaFCl:Eu?* the afterglow has 
been reduced to the CaWO, level. For BaFBr:Eu?* a considerable reduction 
has been achieved but at present the afterglow is at least four times that of the 
tungstate. From an application point of view, therefore, BaFCl:Eu?* is at 
present the most interesting member of the BaFX:Eu?* family. 


3.2. Preparation experiments 


Firstly we observed that even small traces of oxygen in the starting materials 
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or in the gas flow during heating were fatal for both the light output and the 
afterglow of the sample. In fact, an almost linear correlation was found between 
afterglow level and oxygen content. On doping with Eu it was found that 
fluorohalides activated with Eu in the form of EuF, did give products inferior 
to samples where Eu was added as EuCl, or EuBr;. This is ascribed to the 
high melting points of EuF4 and EuF, (namely 1390 and 1380 °C) in relation 
to EuCl;/EuCl, (850 °C/727 °C) and EuBr;/EuBr, (702 °C/677 °C). Analysing 
in more detail the reduction of trivalent Eu and its subsequent incorporation 
into the host lattice, led us to conclude that these processes require high reaction 
temperatures and a strongly reducing atmosphere. However, the proper forma- 
tion of the host lattice was found to be badly affected. 

It was found that BaCl, and more especially BaBr, were much more unstable 
under these conditions than BaFCI and BaFBr, respectively. This led to the 
conclusion that the formation of the host lattice and the reduction/incorpora- 
tion of the Eu ions has to be achieved by two separate steps, the first at tem- 
peratures as low as possible, and the second at temperatures as high as possible. 

In the low-temperature process BaFCl and BaFBr were obtained by evaporat- 
ing to dryness a suspension of BaF, to which BaCl, .2H,O and BaBr, .2H,O 
respectively had been added. X-ray diffraction showed that the product ob- 
tained in this way was free of BaF, and BaCl, c.q. BaBr,. After addition of 
EuCl, (EuBr3) the samples were fired at 850—900 °C in an atmosphere con- 
taining e.g. 1% of H,. The coarse-grained phosphor obtained in this way has 
to be milled to obtain the grain size needed for making intensifying screens. 
This had the effect of increasing the afterglow, but the latter appears to be of 
a different nature from the afterglow discussed above. In our opinion it is 
related to crystallite perfection rather than to chemical defects, since it can be 
restored again to its original level by prolonged heating at e.g. 700 °C in N,. 

Under the preparation conditions described above (high temperature, strongly 
reducing atmosphere) the formation of HCl and HBr seems to be favoured and 
halide vacancies may therefore exist in BaFCI. By doping experiments we have 
found that these chlorine vacancies — if present — do not increase the after- 
glow. For example, addition of small amounts of KCl in BaFCI:Eu?* resulted 
rather in a lower than in a higher afterglow. On the other hand the presence 
of metal vacancies seems to favour the formation of afterglow centres. Addition 
of e.g. LaCl, to BaFCI:Eu?* resulted in greatly increased afterglow. A similar 
effect was also observed on heating BaFCl in HCI gas, diluted with nitrogen. 


4. Measurements 


4.1. Light output of powder samples 


The light outputs on X-ray excitation were measured by comparing powder 
layers of about 500 um thickness (1-4 g/cm?, packing density is 60 27). The results 
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TABLE II 


Measured and calculated light-output data 


measured light 
output 


adjusted light 
output 


calculated light 
output 


phosphor 


CaWO, 100 100 

BaFCI:Eu?* 417 390-450 *) 

(“fine”-grained) 

BaFCl:Eu?* 474 420-480 *) 
*coarse"-grained) 

BaFBr:Eu?* 386 640-680 


*) The effect of grain size on the light output can only be estimated in an inaccurate way. 


are given in table II. The first column represents measured values of the quan- 
tum output, while the second refers to data adjusted for comparison with the 
calculated figures in fig. 1. With a tube voltage of 70 kV and a 20-mm filter the 
effective radiation distribution is peaking at 50-55 keV. For BaFCI :Eu?* there 
is reasonable agreement between the measured data and the values calculated 
' (4-4 times better than CaWO,) for these energies. For BaFBr:Eu?* the agree- 
ment is less satisfactory (the light output was calculated to be 6:5 times higher). 
In fact the measured light outputs are even lower than for BaFCl:Eu?*. At 
present we have no explanation for this phenomenon, since the effectiveness of 
BaFBr:Eu?* on film is quite good (see below). 

The effectiveness in contact with standard film was measured by comparing 
the X-ray doses required to obtain a unit density. For this purpose powder 
phosphors were placed in a cuvette measuring 10 x10 X0-35 mm?. The film in 
direct contact with the cuvette is placed at the X-ray-tube side. With a tube 
voltage of 90 kV and a 3-5-cm AI filter the effectiveness of BaFCI:Eu?* was 
found to be 5-7 times that of CaWO,, while values up to 10 X were measured 
for BaFBr:Eu?*. 


4.2. Afterglow of powder samples 


The afterglow measurements consisted in integrating part of the decay of the 
luminescence as a function of time. A film was placed in contact with the cuvette 
described above, one minute after irradiation of the sample, and left in that 
position for five minutes. The afterglow figure is expressed by the ratio of the 
film density obtained to that of a standard CaWO, screen measured in the same 
way. 

For 30 Róntgen X-ray radiation from a tube at 90 kV (no filtering) we found 
for our best BaFCI:Eu?* an afterglow figure of 1. For BaFCI:Eu?* obtained by 
direct synthesis at high temperatures (cf. sec. 3.1) the afterglow figure was 4-6. 
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In the latter case the afterglow is so bright that it can even be measured with a 
photomultiplier. The afterglows of CaWO, and well prepared BaFC1:Eu?* are 
close to the photomultiplier noise, so that proper comparison with these stand- 
ards was not possible in this way. It was, however, estimated that the after- 
glow of directly synthesized BaFCI:Eu?* is more than ten times that of CaWO,. 
Photomultiplier measurements of the afterglow of BaFBr:Eu (prepared in a 
similar way) gave a ratio up to 100. For BaFBr:Eu?* obtained with our im- 
proved preparative method, this value is reduced to about 4. 


4.3. Measurements on experimental screens 


In figs 9 and 10 we compare new BaFCl:Eu?* screens (4:4 g/dm? « 200 um 
thickness), Philips Universal screens (3:7 g/dm? ~ 120 um thickness), Philips 
Ultra-S screens (8-9 g/dm? ~ 290 um thickness), and commercially obtained 
Y ,O4S:Tb screens of about 350 um thickness. These screens were evaluated 
using a UV- and blue-sensitive standard film. For measurements on a com- 
mercially obtained Gd,O,S:Tb screen (about 190 tum thickness) a green-sen- 
sitive film (cf. figs 3 and 4) was used. For the screen-film combinations tle 
relative speed was defined as the ratio of X-ray doses required to obtain unit 
film densities for each radiation quality considered. The latter was characterized 
by the thickness HVT of Al which reduces the X-ray beam to half its original 
energy. It can be concluded from fig. 9 that the screen consisting of the new 
BaFCl:Eu?* phosphor is superior to the blue-emitting phosphors over the 
whole energy range. At very low HVT the y-oxysulphide is still comparable 
but its relative speed is much lower in the actual medical range. Both CaWO, 
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Fig. 9. Relative speed of X-ray-intensifying screens for various X-ray-energy distributions. 
The latter are characterized by HVT (mm Al), the thickness of Al reducing the X-ray beam 
to half its original energy. The curves include a BaFCl:Eu?* screen, a Gd2O5S:Tb screen, 
a Y202S:Tb screen, and Philips Ultra-S and Philips Universal CaWO, screens. For screen 
thicknesses, see text. i 
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screens, the relatively fast (and thick!) ultra-S and the Universal screens, lay 
still further behind so far as speed is concerned. On comparison of the 
‘BaFCl:Eu?* /standard-film combination with the Gd,O,S:Tb/green-film com- 
bination, we see that the BaFCI:Eu?* combination is better at the lowest X-ray 
energies (HVT =~ 4). At somewhat higher HVT (between 5 and 7) Gd,O,S:Tb 
is slightly better. Above HVT — 8 mm we have approximately equal per- 
formances. 

In fig. 10 we have plotted the contrast rendition as a function of the spatial 
frequencies for the screens discussed in fig. 9. The pseudo-MTF curves were 
obtained by measuring the square-wave response of the screens with the help 
of a Funk test grid consisting of lead strips at varying distances apart. The 
X-ray radiation originated from a tube at 60 kV and was filtered by 3-5 cm 
of Al. The Philips Universal screens had the best resolution. Owing to the fact 
that the screens are thicker and the grains in it are coarser, the pseudo-MTF 
of the Gd,O,S:Tb, BaFCI:Eu?* and Ultra-S CaWO, types is below that of 
the Universal screen. The same conclusions appear to hold even more defi- 
nitely for the Y,O,S:Tb screen. 

The data in figs 9 and 10 can be combined by defining a relative quality factor 
F — S R? where S is the speed and R the resolution measured by the methods 
described above. The excellent performance of the new BaFCl:Eu?* is illus- 
trated by the fact that F relative to Universal screens ranges for 3 up to 7. 
With respect to an Ultra-S screen, F for BaFCl:Eu?* screens is 3-3-5. Because 
of poor resolution the F number of Y,0,S:Tb ranges from 1-5 to even below 1, 
compared with both types of CaWO, screens. For the Gd,O,S:Tb combination 
we find F with respect to the Universal screens in the range 2:5 to 6 and on 
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Fig. 10. Relative contrast vs spatial frequencies for the screens in fig. 9. 
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comparison with Ultra-S screens between 2 and 3. In both cases BaFCI:Eu?* 
is equal or even better than Gd,O,S:Tb. 


5. Conclusion 


Compared to traditional phosphors such as CaWO, and to more-recently 
developed UV- or blue-emitting phosphors, BaFCI:Eu?* and BaFBr:Eu?* are 
superior for use in X-ray-intensifying screens. If also green- or white-emitting 
phosphors are taken into account, we find that the fluorohalide-Eu?* screens 
are at least comparable with e.g. Gd,O,S:Tb screens, but have the important 
advantage that no special films have to be used. 

These conclusions are supported both by calculated figures of merit and by 
measurements on powders and experimental screens. By using appropriate pre- 
parative methods the original drawback of afterglow after irradiation has been 
eliminated for BaFCI:Eu?*. The high speed of BaFCI:Eu?* screens make us 
believe that with them dose reductions can be achieved in full-size radiography. 
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EUTECTIC-LIKE AND PERITECTIC-LIKE REACTIONS 
IN TERNARY SYSTEMS AT A CONSTANT PRESSURE 


PART I. SYSTEMS CONTAINING ONE LIQUID 
AND TWO SOLID PHASES 


by J. van den BOOMGAARD 


Abstract 


The equilibria between the liquid phase and two solid phases in very 
simple ternary systems were considered in a previous paper, in which 
an auxiliary curve, the effective ternary solidus curve, was introduced 
as a tool for the description of a steady-state solidification process. The 
paper, however, was restricted to very simple ternary systems. In the 
present paper the treatment is of a more general character and includes 
peritectic-like reactions and equilibria as well as transitions of eutectic- 
like equilibria into peritectic-like equilibria. Once again the concept of 
the effective ternary solidus curve was found to be a convenient tool. 


1. Introduction 


In a previous paper *) we introduced the effective ternary solidus curve (ETS) 
and its projection on the composition triangle (PETS) in ternary diagrams at 
constant pressure. 

The PETS has been defined as the locus of the intersection points Q, of the 
tangents to the projection of the ternary liquidus curve, the PTL, in a com- 
position P, and the projections of the tie lines CD,, between the compositions 
of both solid phases in equilibrium with a liquid of composition P, (fig. 1). 
Each Q, should be the average composition of the two solid phases growing 
from that liquid if an infinitesimal amount of the liquid has solidified. 

For the simple ternary systems dealt with in ref. 1 and in the first two sections 
of this paper, this definition does not lead to difficulties because one of the solid 
phases is a pure component while the other consists of solid solutions of the 
two other components. However, expansion to ternary systems in which both 
phases may contain the three components requires a reconsideration of this 
definition. The mean composition of a two-phase solid growing under equi- 
librium conditions from a melt on the TL must be situated on the tie line 
between the individual compositions of the two solid phases or coincide with 
one of these compositions, but it can never be situated on the extension of such 
a tie line. Consequently the PETS changes its character if the projection of the 
effective tie line, i.e. the tangent at a certain point of the PTL in the direction 
in which the TL moves to higher temperatures, is not situated between the 
projections of the two tie lines connecting the composition of this liquid with 
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the compositions of the two solid phases in equilibrium with it. Under such 
circumstances the liquid solidifies with a peritectic-like reaction, as will be shown 
in the following sections. 

To demonstrate the usefulness of the PETS, we start (sec. 2) with the de- 
scription of some solidification processes in the simple ternary systems treated 
in ref. 1. After that the restrictions concerning solid solubility are dropped. 
Ternary systems with two completely separated phases are considered in sec. 3. 
In sec. 4 some cases of incompletely separated phases are considered. 


2. Solidification processes in a eutectic trough in simplified ternary systems 


We will treat some solidification processes at constant pressure in a liquid 
whose composition is situated in a eutectic trough in a very simple ternary 
diagram (figs 1 and 2). 


2.1. Solidification under equilibrium conditions 


Solidification under equilibrium conditions means that at each moment 

during the solidification process both solid phases and the liquid phase are 
homogeneous and in equilibrium with each other. 
i, The composition triangle of a ternary system is given in fig. 1. E, is the 
eutectic composition in the 4-C system and E; the eutectic composition in the 
B-C system. It is also assumed that the temperature decreases continuously 
along the ternary liquidus curve (TL) of which E,P,E, is the projection (PTL) 
in the direction from the eutectic in the A-C binary system to the eutectic in 
the B-C binary system (Tz, > Tz,). 


i 
Dj Dj, Dj D; Ds à 
Fig. 1. Composition triangle with PTL (E,P,E;) and PETS (E, Q,E») projections of the 
path of the liquid (P,P... Ps) and of the solid (Q,0'2Q'5P,) during solidification under 
equilibrium conditions. 
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If an amount of liquid whose composition is given by P, on the PTL is 
solidified reversibly, the average composition of the first infinitesimally small 
amount of the two-phase solid, grown from that liquid, is given by Q, which 
is situated on the PETS; Q, is the point of intersection of the tangent to the 
PTL in P,, drawn in the direction in which the TL moves to higher tempera- 
tures, and CD, the projection of the tie line between the two solid phases in 
equilibrium with the liquid of the composition P,. 

On further solidification both solid phases and the liquid phase must remain 
in equilibrium with each other and each phase must remain homogeneous, 
because we assumed equilibrium during the whole solidification process. As 
the liquid changes continually in composition during this solidification, the 
average composition of the solid phases must also change. 

To discover how this average composition changes we examine the state of 
equilibrium when the liquid has reached the (homogeneous) composition given 
by P, on the PTL. As we are dealing with a closed system the average com- 
position of the three phases (2 solids + 1 liquid) must be given by P,, and 
therefore the average composition of the two solid phases together must be 
situated on the straight line P,P,. It must also be situated on the projection 
of the tie line of the solid phases in equilibrium with the liquid of composi- 
tion P,. The average composition of the two solid phases is therefore given by 
the point of intersection of the two lines, Q’,. It is therefore possible to de- 
termine such a composition Q’, for each composition P, assumed by the liquid 
during solidification, and consequently the whole solidification path can be 
plotted. Because the composition of the liquid during such a process moves 
along the PTL, the composition of the last infinitesimal drop can be found by 
drawing the line CD; through P, which cuts the PETS at Q, and the tangent 
through Q; to the PTL in the direction in which the TL moves to lower tem- 
peratures. The tangent point P, gives this composition. Thus the average solid- 
ification path of the two solid phases as they grow from the liquid starts on 
the PETS and ends on the PTL. This holds for every starting composition of 
the liquid situated on the PTL. 

Such equilibrium solidification processes, however, will hardly ever be en- 
countered in practice. 


2.2. Solidification under semi-equilibrium conditions 


We consider a solidification process L—- S, + S2 in a similar ternary 
system, assuming that there is always equilibrium in the liquid and also at the 
solid-liquid interface but that the solid phases, once grown from the liquid, do 
not change either in composition or in amount; i.e. the liquid remains homo- 
geneous during the solidification process but the solid phases do not. 

If a liquid of composition P, (fig. 2) is solidified under such conditions the 
first infinitesimal amount of the two-phase solid has the average composition, 
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B 


Fig. 2. Composition triangle with PTL (£,P,£2) part of which is the path of the liquid 
(P,P... E2) during solidification under semi-equilibrium conditions, and PETS (E; Q, £2). 


given by Q,, and the ratio in which the two solid phases occur is given by 
Q,C/Q,D, where D, and C represent the individual compositions of the two 
phases. The next infinitesimal amount of the solid has another composition, 
also situated on the PETS (E, Q,£,). During the solidification process the com- 
position of the liquid moves along the TL towards lower temperatures and thus 
along the PTL in the direction of E;. At P2, the last infinitesimal amount of 
solidified material has the average composition given by Q2, while the first 
amount still has the composition given by Q,. The result of such a solidification 
process (e.g. a Czochralski process in which a constant-diameter bar of two- 
phase material is pulled from the liquid in a crucible while the liquid is kept 
homogeneous by means of ultrasonic agitation) is a bar in which the average 
composition changes with the distance along the bar in the direction of growth. 
Jf, after the experiment is ended, very thin slices perpendicular to the growth 
direction are cut from the bar the average composition of each slice will be 
situated on the PETS, the first solidified slice having the composition given 
by Q,. If in the situation in fig. 2 solidification is stopped at the moment the 
liquid has reached the composition P3, the last solidified slice will have the 
composition given by Q3. The average composition X of the total bar solid- 
ified at that moment must be situated on the line P4P, and between R,, the 
point of intersection of this line with the chord Q,Q;, and R,, the point of 
intersection of this line with the PETS, as will be readily seen (fig. 2). The exact 
position of X between R, and R, depends on the shape of the PTL and the 
PETS. The PETS also plays an important role in this idealized solidification 
process. 
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2.3. Steady-state solidification 
For the role of the PETS in steady-state solidification see ref. 1. 


3. General treatment of the equilibrium L *5 Sa + Sg in a ternary system with 
two separate solid phases and one liquid phase 


If the solid solubility of C in 4-B and that of A and B in C are taken into 
account, other possibilities for the equilibria between the liquid and the two 
solid phases are created. Figure 3 represents a projection of the three-phase strip 
on the composition triangle, for such a case. It consists of three curves: the PTL 
(£,PE,), the projection of a ternary solidus curve, PTS; (G,G4G.), which is the 
curve of compositions of the f phase in equilibrium with the liquid phase and 
the « phase, and the projection of a ternary solidus curve, PTS, (D,D3D,), giving 
the compositions of the « phase in equilibrium with the liquid phase and the 
B phase. In this figure the projection of the effective ternary solidus curve is also 
constructed in the usual manner (E, QE;). The temperature is assumed to de- 
crease again along the TL from the eutectic E,, between 4 and C, to the 
eutectic E;, between B and C. The difference between this and the previous 
figures +) is that the extensions of the tie lines between the compositions of the 
two solid phases and the liquid phase generally do not run through C and that 
these tie lines do not start or end in one of the binary systems. 

Because of the occurrence of the PTS, and the PTS, there may be points of 
intersection between the PETS and/or the PTL and these curves. 

For the binary boundary systems A-C and B-C this means that peritectic 
three-phase equilibria (L $5 S, £5 S) must also be taken into account, and 
not only in these binary systems but also within the ternary system. 


` Fig. 3. Composition triangle with the projection of the three-phase strip and the PETS. The 
PTL is situated between the PTS, and the PTS;. 
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3.1. The projections of the effective tie lines are situated between the projections 
of the real tie lines of the equilibria LS S, and LS S; 

This situation is practically identical to that of the simplified systems dealt 
with previously‘) and all that has been said about the TL with respect to 
tangency, curvature, inflection, temperature extremes, etc., also applies here. 
~ Although the PTL in most of these cases will be situated between the PTS, 
and PTS, (see fig. 3), that is not always necessarily the case. In fig. 4 the PTL 
intersects the PTS, twice. Nevertheless, the TL is situated in a eutectic trough 
and the PETS is everywhere situated between the PTS, and the PTS,. 


Fig. 4. Composition triangle with the projection of the three-phase strip and the PETS. The 
PTL intersects the PTS, twice but the PETS remains complete. 


3.2. The projections of the effective tie lines are not situated between the PTS, 
and the PTS, 


Two possibilities for such a situation are given in fig. 5. In both cases the 
PETS, or better, the curve of intersection points Q,, of the tangents to the PTL 
at points P, and the extensions of the projections of the tie lines (G,D;) between 
the two solid phases in equilibrium with a liquid of composition P, can be 
constructed, as also can the lines P,Q,. However, if a liquid of composition P, 
(fig. 5) is solidified a solid « phase of composition D, will grow from it. Con- 
sequently, the composition of the liquid moves off the PTL in a direction op- 
posite to the direction P,D,. i 

The same applies to the situation in fig. 5b if « is replaced by f and D, by C;. 
Thus, if the TL is situated on a peritectic ridge *) there is no real ETS and there- 


*) In literature the term "melting gutter" is often encountered. This term is also used, 
especially in German literature (Schmelzrinne) for a TL situated on a peritectic ridge. 
We prefer the expression “peritectic ridge” in this case. 
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B 


Fig. 5. Composition triangle with the projection of the three-phase strip. The PTL is not 
situated between the PTS, and the PTS,. 

(a) The PTS; is situated between the PTL and the PTS,. (6) The PTS, is situated between 
the PTL and the PTS. 

In neither case is the constructed PETS (dashed curve P, Q,P2) situated between the PTS, 

and the PTS, and therefore it has lost its significance. 


fore no real PETS either. That is why we have used dashed curves for the con- 
structed PETS curves in fig. 5. This does not mean, however, that these curves 
or the dashed lines P,Q, are devoid of meaning. This can be easily seen if we 
start with a composition in which not only the liquid phase is present but also 
the « and/or the f phase in equilibrium with that liquid of composition P, 
(fig. 5a). If under these circumstances the temperature is decreased by an 
amount óT, the composition of liquid remains on the PTL as long as the equi- 
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librium is a three-phase equilibrium, i.e. the liquid behaves as if an infinitesimal 
amount of a solid with the average composition Q, has solidified and therefore 
its behaviour can be described with the aid of the curve P, Q,P; and the lines 
P,Q,. For this reason we call P, Q,P; a virtual PETS and the lines P,Q, projec- 
tions of the virtual effective tie lines. This does not mean that the liquid whose 
composition is situated on a virtual PETS always solidifies peritectically. That 
depends on the amounts of the three phases present in the equilibrium and on 
the curvatures of the PTL, the PTS,, and the PTS, 23). During the solidification 
process the composition of the liquid remains on the PTL until one of the solid 
phases has disappeared. The length of the path along the PTL followed by the 
liquid during the solidification process depends on the amounts of the solid 
phases present at the beginning of the process. Because our considerations are 
restricted to the solidification reaction of liquids whose composition is situated 
on the PTL, while no solid phases are present, the virtual parts of the PETS 
will not be shown in the subsequent figures. 


3.3. The projections of the effective tie lines are only partly situated between the 
projections of the real tie lines of the equilibria LS S, and LS S; 


For this situation there are a number of possibilities several of which will be 
considered. 


3.3.1. The binary boundary system A-C is of the peritectic type and 
the binary boundary system C-B is of the eutectic type 


We will consider two possibilities: 
(1) the temperature along the TL is continuously decreasing from the peritectic 
between A and C to the eutectic between C and B (fig. 6a); 
(2) the temperature is continuously increasing ‘along the TL in that direction 
(fig. 6b). 


In case (1) the PTL (P,P;P4P4E;) intersects the PTS, in P; but this is not 
an interesting point on the PTL because the projection of the effective tie line 
at this composition is generally not situated between the projections of the real 
tie lines P,G3 and PD, (see fig. 6a). This also applies to the compositions in 
the neighbourhood to both the left and the right of P, on the PTL. The inter- 
esting point of the PTL is P}, where the projection of the effective tie line, P;Q3, 
coincides with the projection of the real tie line P3D, (Q3 = D4). This means 
that solidification of a liquid whose composition is situated on the PTL between 
P, and P, starts with the formation of the f phase, although these liquids can 
also be in equilibrium with the « phase (peritectic-like solidification). The solid- 
ification of liquids whose compositions are situated on the PTL between P, and 
E, is eutectic-like, because here, both phase « and phase f grow from the liquid. 
The projection of the peritectic ridge P,P; passes at P; into the projection of the 
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Fig. 6. Composition triangle with the projection of the three-phase strip. The PTL intersects 
the PTS; giving rise to an incomplete PETS. 

(a) Situation for the case in which the temperature decreases continuously from the peritectic 
between A and C to the eutectic between C and B. (b) Situation for the case in which the 
temperature increases in that direction. 


eutectic through P4E;. (As remarked in the previous section, the solidification 
is only peritectic between P, and P, and eutectic between P, and E, if only the 
liquid phase is present. If this is not the case, also here again a reaction actually 
takes place which type of solidification depends on the amounts of the solid 
phases present in the system and on the curvatures of the PTL, PTS, and PTS,.) 


In case (2) nearly the same applies but, owing to the increasing temperature, 
P is situated between P, and P; while in case (1) it was situated between P, 
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and £,. A real ETS and hence a real PETS are thus only associated with the 
part of the TL situated in a eutectic trough, which is tangent to the PTL in E; 
for case (1) but not for case (2). 


3.32. Both binary boundary systems A-C and B-C are of the 
eutectic type 


In a ternary system of this kind it is also possible for a part of the TL to be 
situated on a peritectic ridge. It is not even necessary for the PTL to intersect 
one of the PTS curves. 

This situation is drawn in fig. 7. The parts of the PTL, £,P,P, and P4P4E, 
are projections of the parts of the TL situated in a eutectic trough while the 
projection of the part of the TL situated on the peritectic ridge is given by P,P3; 
£,0,Q, and Q30,E, are the two real parts of the PETS. The last part is tangent 
to the PTL in E, (because Tg, > Tz,). No new viewpoints occur if the PTL 
intersects one of the PTS curves twice. 


IN 
IN 
vh yS 


/\ 
/ FN 


Fig. 7. Composition triangle with the projection of the three-phase strip. Although the PTL 
does not intersect the PTS, or the PTS, the PETS is incomplete and consists of two separated 
parts. 


3.3.3. Both binary boundary systems A-C and B-C are of the 
peritectic type 


Here again we will deal with two possibilities: 
(1) the PTL does not intersect a PTS curve, or intersects one of these curves 
twice; : 
(2) the PTL intersects both PTS curves. 
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(1) The first case is sketched in fig. 8a, where parts P,P, and P,P, of the 
PTL are projections of the parts of the TL situated on a peritectic ridge and 
the part P4P4P, is the projection of the part of the TL situated in a eutectic 

trough. To this part a real PETS, 0;0,Q;, can be constructed. 
.. It might be thought that such a situation is only possible if the binary bound- 
ary systems assume rather unusual shapes. In fig. 8b, however, where we have 
drawn a system of this kind in perspective, the three binary boundary systems 
are seen to have very common shapes while the TL, R, R;R5R,, is situated partly 
on a peritectic ridge, viz. between R, and R, and between R, and R,, and partly 
in a eutectic trough, viz. between R, and R3. 

In the composition triangle the PTL is given by P,P, the PTS, by G,G;, 
the PTS, by D,D, and the real part of the PETS by 030,05. The fact that 
in some of the sections through C perpendicular to the composition plane the 
intersection curve with the liquidus surface ToR R R4Tc may exhibit a tem- 
perature maximum has no special significance, because the compositions of the 
solid phases in equilibrium with a liquid whose composition is situated in such 
a maximum are not normally situated in a section of this kind. 

In the case we considered above it is also possible that the PTL intersects the 
PTS, twice but again this does not introduce new points of view. 


(2) The second case is sketched in fig. 9. Here the PTL is the projection of a 
peritectic ridge between P, and P, and also between P, and P,; between P3 
and Ps it is the projection of a eutectic trough. The PETS belonging to part 
P,P; of the PTL is given by Q50Q,0;. 


p, Te, 


D 
Aa LN M ENGEL. 
A I B 
Fig. 8a. Composition triangle with the projection of the three-phase strip. Although the PTL 


is not situated between the PTS, and the PTS,, a part of the TL is nevertheless situated in a 
eutectic trough giving rise to an incomplete PETS (030,05). | 
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Fig. 8b. Perspective drawing of the situation in fig. 8a. 


4. Eutectic-like and peritectic-like reactions in ternary systems containing one 
liquid phase and two incompletely separated solid phases a, and a, (partial 
miscibility) 

(a) If a ternary system comprises only one liquid phase and one of the binary 
boundary systems contains two solid phases, giving rise to a binary phase 
diagram at a constant pressure of the eutectic or peritectic type but the 
solid components in the other two binary boundary systems are completely 
soluble in each other, we have an incomplete three-phase strip. 

In the first-mentioned binary boundary system a three-phase equilibrium 
starts between one liquid and two solid phases which has disappeared before 
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A 


Fig. 9. Composition triangle with the projection of the three-phase strip. The PTL starts in 
one of the binary peritectic compositions and ends in the other one, intersecting the PTS, 
as well as the PTS,. 


one of the other two binary boundary systems is reached. Because we have 
assumed that no compound is formed within the binary system, one phase 
disappears somewhere in the system without a new phase being formed. 
As the liquid phase exists in the whole system this is only possible if the two 
solid phases become identical somewhere in the system. If an isothermal 
section is made through the system below the lowest temperature at which 
the liquid can exist, solid solutions of the three components are found ex- 
cept for compositions situated within a demixing loop. The three-phase 
equilibrium (1 liquid 4- 2 solids) also shows such a demixing loop for the 
two solid phases. In the case under consideration the three-phase strip starts 
in the binary boundary system which is of the eutectic or peritectic type. 
Starting at this system and going along the three-phase strip inside the 
system a point will be reached in which the compositions of the two solid 
phases in equilibrium with the liquid have become exactly the same and in 
which the difference between the solid phases has disappeared. This is the 
last point of the three-phase equilibrium. Beyond that point no three-phase 
equilibrium exists because the system becomes bivariant at such a point. 

(b) It is also possible that the components in the three binary boundary systems 
form solid solutions in every proportion, but that there is a demixing loop 
for the two solid phases in isothermal sections through the system at tem- 
peratures at which no liquids can exist. In this case the demixing loop is a 
smooth closed curve. The three-phase strip of the equilibrium between the 
two solid phases and the liquid phase is again incomplete and contains two 
points where the difference between the two solid phases disappears. 
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As proved elsewhere ?) the projection of the effective tie line that connects 
the composition of a liquid at such a point (where the two solid phases of 
the three-phase equilibrium LS S, £5 Sg become identical) with the co- 
inciding compositions of the two solid phases must be tangent to the PTL. 


4.1. Three-phase equilibria (L €5 S, © Sg) with a one-sided closed demixing loop 
for the two solid phases in the three-phase strip 


In this group of equilibria one of the binary boundary systems contains a 
three-phase equilibrium of the eutectic or peritectic type and the three-phase 
strip contains one situation in which the difference between the two solid phases 
has disappeared (the ultimate composition). We will examine some possible 
situations. In fig. 10 it is assumed that the TL as a whole is situated in a eutectic 
trough. This means that the projections of the effective tie lines P,Q, are always 
situated between the projections of the real tie lines P,G;,.,. The length of the 
tie lines D,G, between the solid phases in equilibrium with the liquid phase is a 
function of the composition of the liquid on the PTL. In the ultimate com- 
position of this liquid the difference between the two solid phases has dis- 
appeared. This means that the projection of the tie line between the two solid 
phases has become transformed into a tangent (/ in fig. 10) to the smooth curve 
G,G,G;...D3D,D, in the coincident compositions Gs and D, which is the 
end point of the PETS (Q4). The projection of the effective tie line, P4Q4, co- 
incides with the projections of the two real tie lines between the ultimate com- 
position of the liquid and the compositions of the two solid phases, i.e. there 
is only one real tie line whose projection is tangent to the PTL at P,, the 


Fig. 10. Composition triangle with the projection of the (incomplete) three-phase strip, if the 
components form solid solutions in every proportion in two of the binary boundary systems. 
E,P, is the projection of an incomplete TL situated in a eutectic trough; E104... Qs is 
the PETS belonging to this TL. 
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terminal point of the PTL in accordance with what has been said above. The 
PETS is given by E,010503Q.. 

It is also possible that the PETS starts in the eutectic composition but inter- 
sects the PTS loop before the liquid in equilibrium with the two solid phases 
has reached the ultimate composition. This situation is shown in fig. 11. Here 
£,P,P,P3P, is the PTL. The part of the TL belonging to E P,P, is situated 
in a eutectic trough. The part of the TL belonging to P,P3P, is situated on a 
peritectic ridge. 

E,Q4,0; is the real part of the PETS and Q5Q4Q, the virtual part (drawn as a 
dashed curve). P,Q, is tangent to the PTL at P4. In this case Q4 (= D; = Gs) 
is a singular point of the PETS because it is a real point. If, as in fig. 11, the last 
part of the PTL belonging to P,P, is situated as a whole on a peritectic ridge, 
it follows from the construction of the PETS that the virtual part of it is tangent 
to the PTS curve at Q,, which coincides with the coinciding compositions of the 
solid phases a, and «z, Gs and Ds, respectively. The projection of the tie line 
between these coinciding compositions (/ in fig. 11) is therefore also tangent to 
the virtual PETS at this point Q4 (= Ds; = G;). We think this is not a situa- 
tion that will often be met in practice, and believe there is a greater chance 
that the last part of the TL in the neighbourhood of the ultimate composition 
of the liquid will be situated in a eutectic trough again as sketched in fig. 12. 
Here E,P, is the projection of a part of the TL situated in a eutectic trough and 
EQ, is the real part of the PETS belonging to it. P,P, is the projection of the 
part of the TL situated on a peritectic ridge and Q, Q, is the virtual part of the 


C 


A 


Fig. 11. The situation resulting in the case of fig. 10 if the PETS changes from real into 
' virtual. The ultimate point is Q4 if the PETS is real again. , 
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Fig. 12. A similar situation as in fig. 11, but more probable. The PETS has two real parts, 
£,Q, and Q2Q3, and one virtual part Q4Q;. 


PETS belonging to it. P,P is the projection of a part of the TL situated again 
in a eutectic trough and Q;Q the real part of the PETS belonging to it. In the 
limit case P; and P, and thus also Q, and Q, coincide, and the situation of 
fig. 11 is obtained. 

If the TL starts in the binary boundary A-C system as a peritectic, the situa- 
tion of fig. 13 will normally be obtained, in which the last part of the TL is 
situated in a eutectic trough (P,P3) whilst the other part is situated on a 
peritectic ridge (P,P,P;). In the limit case for P; coinciding with P3, the real 
part of the PETS, Q,Q3, has shrunk to a point and the TL is situated as a 
whole on a peritectic ridge. The virtual PETS is tangent to the PTS at Q, in 
this case, in accordance with what is said in connection with fig. 11. 

Of course it is possible to consider a number of other situations, but with the 
aid of what has been said in this section it will be easy to draw them. 


4.2. Three-phase equilibria (LS S, € Sg) with a two-sided closed demixing loop 
for the two solid phases in the three-phase strip 


If in a ternary system the two components in each of the three binary bound- 
ary systems form solid solutions in every proportion at temperatures where no 
liquid can exist there may be a three-phase strip inside the ternary system. The 
projection of such a three-phase equilibrium (L £5 S, £5 S,) on the composi- 
tion plane consists of a two-sided closed demixing loop of the two solid phases 
and a curve giving the compositions of the liquids in these three-phase equi- 
libria (the PTL). 
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C 


A B 


Fig. 13. The boundary A-C binary system is of the peritectic type. The part P, Q2 of the 
PETS is virtual, the part Q2 Q3 is real, except if Q, and Q, coincide. In that case the PETS 
has only one real point, Q3. 


Such a situation is sketched in fig. 14. In this figure P,P,P.P3P, is a PTL. 
The part P,P,P, belongs to a eutectic trough, the part P,P, to a peritectic 


C 


A B 
Fig. 14. The projection of the three-phase area in the case of a ternary system, containing a 
two-sided closed demixing loop of the solid phases. In the chosen situation the PETS be- 
longing to the incomplete PTL consists of two real parts Q1Q; and Q5Q and one virtual 
part Q2Q;. 
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ridge and the part P4P, to a eutectic trough again. P,Q, is tangent to the PTL 
at P,, P,Q, is tangent to the PTL at P2, P3 Q3 is tangent to the PTL in P5, and 
P,Q, is tangent to the PTL in P4, entirely in accordance with the rules given 
above. 0,Q, and Q3Q, are real parts of the PETS and Q,Q, is the virtual 
part of it. Here too, many other situations are possible. 


5. Closing remarks and conclusions 


It has been demonstrated that the PETS is a useful curve in describing solid- 
ification processes in ternary systems. In the systems dealt with in this paper the 
real part of the" PETS is always situated between the projections of the two 
solidus curves which means that it starts and ends in binary eutectic composi- 
tions or in compositions situated on one of the projections of the ternary 
solidus curves. : 

The examples of thé ternary systems given in the preceding sections are not 
the only possibilities of the ternary systems dealt with in this paper. Other types 
of system are possible. We have not considered the cases in which the TL ex- 
hibits a temperature extreme or in which the PTL shows inflection points. The 
interested reader will be able to deal with other possible ternary systems with 
the aid of the considerations given in ref. 1 and in this paper. The examples 
given were intended to show the possibilities of eutectic-like and peritectic-like 
reactions in a number of ternary systems even if the three binary boundary 
systems themselves do not exhibit peritectics and/or eutectics. 


Eindhoven, November 1973 
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INTRODUCTION AND REMOVAL OF 
HYDROXYL GROUPS IN VITREOUS SILICA 


PART III. SOME THERMODYNAMIC DATA ON THE 
REDUCTION OF VITREOUS SILICA 


by G. H. A. M. van der STEEN 


Abstract 


On the basis of the data given in this and earlier published papers t?) it 
can be concluded that hydrogen dissolves in vitreous silica according to 
a reaction of the following type: 


28i0; + H3 = =SiOH + =SiH 
T 
where = SiOH :: Si0,s0H:: $-Si-oH. 
e 


with 
AG 1500-1800 x = 34500 + 1425 T. (cal). 


The change in Gibbs free energy seems to be slightly dependent on the 
glass-preparation technique. A linear relationship is found between the 
OH mole fraction and the square root of the hydrogen partial pressure. 
Bubble formation during the cooling of a melt can be explained with the 
aid of these thermodynamic data. The results of experiments where 
H2 (D2)/H20 (D20) mixtures were used during melting, support a 
reaction mechanism according to which equimolar amounts of =SiOH 
and zSiH groups are formed. 


1. Introduction 


Some data have been published on the thermodynamics of the reduction of 
(vitreous) silica by hydrogen ?:*:5:7). Most investigations on the reduction of 
silica concern the reaction ?:*:5) 


SiO; + H, > SiO + H,O. (1) 
cryst. gas gas gas 
This reaction seems to have a strongly endothermic character. According to 
Ramstadt 5): ; 
AG? 1700-1900 x = 127100 — 45-07 T (cal). 


Hydrogen can be introduced in the form of =SiOH groups and =Si-H cen- 
tres ?) according to the reaction 
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2 SiO, + H;  zSiOH + zSiH (2) 
glass : 
-e- 
I 
where = SIOH :: Si0,50H:: $-Si-oH. 
©- 


The vitreous-silica network is considered to be the solvent for the hydrogen- 
containing centres. If the gaseous reaction products from reaction (1) are dis- 
solved in the glass network, accompanied by some rearrangement reaction (or 
complex formation), the net result will be the same as for reaction (2). The 
difference between the Gibbs-free-energy changes of reactions (1) and (2) is 
attributable to the heat effects and entropy changes of the following reaction: 


H,O + SiO + SiO; — =SiOH + zSiH. (3) 
gas gas glass 


This work is concerned with the study of reaction (2). A phenomenon like 
bubble formation during the cooling of a melt can be explained with the aid 
of the data found. 

Another reaction which introduces hydroxyl groups into vitreous silica is 
the reaction between water and the glass: 


2 SiO, + HO — 2 zSiOH. (4) 
glass gas 


This reaction has been studied by Moulson and Roberts ê). Reaction (2) com- 
bined with reaction (4) can be used to demonstrate the existence of —SiH-like 
centres. 


2. Experimental 


2.1. Preparation of the samples 


Milled and rinsed Brasilian-rock crystals were melted in a molybdenum 
crucible at 1950 °C in an H,/He atmosphere. A more detailed description of 
the raw material and melting procedures has been given in an earlier paper *). 
Plates of about 3 mm thickness were cut from the glass ingot and polished 
bilaterally. í 

At the melting temperature (1950 °C) equilibrium OH concentration is not 
reached. The penetration of hydrogen into the melt is probably hampered by 
a reaction at the surface of the melt +). At lower temperatures (< 1550 °C) 
the kinetics of this reaction has become negligibly small and equilibrium con- 
ditions for reaction (2) can be reached. In order to establish equilibrium con- 
ditions, the vitreous-silica plates were fired for 16 hours in an electrically heated 
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vertical-tube furnace (PCA 10/10 Lab furnace), at the desired temperature and 
hydrogen partial pressure. The hydrogen partial pressure was changed by 
mixing H, and N, in a mixing apparatus. The use of several accessories made 
it possible to quench the samples in the same atmosphere as that in which 
they were fired. 

When the samples were heated at temperatures between 1200 and 1500 °C, 
the surface of the glass plates was usually recrystallized and therefore the 
samples had to be repolished. 

The intensity of the 2-73-um absorption band was measured with the aid 
of a Jasco-IRA 2 double-beam spectrophotometer. 

In determining the OH mole fraction (xon) with the aid of the 2-73-um ab- 
sorption band, the practical molar extinction coefficient, which relates the 
optical density per mm (Soy value) to the hydroxyl concentration, plays an 
important role. In a previously published work ?) we have described an 
extraction method for determining the extinction coefficient, a value of 
58 1 mole^! cm=! is found for ¢9, and therefore: 


10 Bou Msio 
xou — — — 5 (5 
ton Ügiass 
where xoy = mole fraction OH, 

Msio; = 60-1 g/mole, 

Csinss = 2200 g/litre, 

Eoy = 58 litre/mole cm, 

Bou = optical density per mm. 


3. Results and discussion 


3.1. OH concentration in vitreous silica as a function of the H, partial pressure 


The vitreous-silica plates were fired at 1550°C in an H,/N. mixture for 
16 hours and quenched. After repolishing, the Boy value was measured 
(Bou value = optical density per mm). 

Real equilibrium conditions should be independent of the path followed. 
To check this, two types of samples were used. 

Type a: Freshly molten samples (15 min, 1950 °C, 100% H;, quenched) with 
a Bou value higher than the equilibrium Boy value. 

Type b: Samples fired for 24 hours in vacuum at 1050 °C. This treatment 
reduces the Bo, value to practically zero. 

Figure 1 indicates whether the initial OH concentration was higher (O) or 

lower (X) than the equilibrium concentration. A linear relationship is observed 

between the square root of hydrogen partial pressure and the OH concentra- 

tion (Yon = 2-17 . 1074 Vpn, at 1550 *C). 
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Fig. 1. Mole fraction of hydroxyl as a function of the square root of the hydrogen partial 
pressure. O: initial OH concentration higher than equilibrium concentration; x: initial 
OH concentration lower than equilibrium concentration. 


3.2. Standard Gibbs-free-energy changes for the reduction reaction 

In studying the temperature dependence of the reaction, nearly the same 
procedure was followed as in studying the pressure dependence, except for the 
fact that in this case the hydrogen pressure was held constant (1 atmosphere) 
while the temperature was changed. Assuming that reaction (2) is valid, the 
following relationship will hold: 


(6) 


x x 
AG? = —RT In K = —RT In (mimi) 


Ysio;? Xsio;^ f Pn; 
where 

Ysio5: You: Ysin are the activity coefficients of the various reactants, 

f is the fugacity coefficient for hydrogen, 

Xsio.» Xom Xsin are the mole fractions, 

AG? is the standard Gibbs-free-energy change for reaction (2), where one mole 
hydrogen has reacted to =SiOH and =SiH groups in the form of a hypo- 
thetically ideal solution. 

When the fugacity coefficient f is taken unity and Yom Ysin» Ysio, = 1 for 

Xsion» Xsin — 0, with xon = Xsin, and xgo, assumed to be constant, equation 
(6) reduces to 
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Xon? 
AG? = —RT In : (7) 
Puz 


In fig. 2, 4G? is plotted as a function of temperature. The best-fitting straight 
line gives 
4G? = 34500 + 14-25 T (cal) (glass 1). (8) 


The reaction thus seems to be endothermic, with a negative entropy change 
(AG? = AH? — T AS?). The negative entropy change is probably mainly due 
to disappearance of the gaseous hydrogen. 

The only known thermodynamic quantities for hydroxyl groups in vitreous 
silica were reported by Moulson 5) and Bell ?). Moulson studied the reaction be- 
tween water and vitreous silica (reaction (4)) and found 4H; = —6 kcal/mole. 
Bell investigated the reaction between hydrogen and vitreous silica. The heat 
of solution found by Bell (4H; = —2 kcal/mole) differs considerably from 
our own values and even the sign of the heat effect is different. This is prob- 


60 


AG (kcal) 


50 


20,500 1500 2000 2300 


—-T(K) 


Fig. 2. Standard Gibbs-free-energy changes as a function of temperature for the reactions 


(A) 2SiO; + H2 > =SiOH + =SiH (x: glass 1, e: glass 2), 
glass gas diss. , diss. 


(B) SiO, + H: — SiO + H0 (dash-dotted). 
glass gas gas gas 


(Reaction (B) according to Ramstadt 5), corrected for the crystalline-glass transition.) — , 
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ably due to the fact that Bell derived the reaction enthalpy from only two 
measuring points (800 and 1050 °C). According to our experiments, the equi- 
librium OH concentration is very low in this temperature range and very 
difficult to measure accurately. 

Between 1700 and 1850 K, thermodynamic data are obtained for reaction (1) 
(AG° = 127100 — 45-07 T) 5) and for reaction Q) us = 34500 + 14-25 T 
(this paper)), see fig. 2. 

In the study of reaction (1), crystalline SiO, powder was used. The thermo- 
chemical properties of the vitreous state differ slightly from those of the 
crystalline state. This can be compensated for as follows: 


SiO, +H, —SiO + H,O, AG? = 127100 — 45-07 T 
cryst. gas gas gas 


SiO, — SiO;, AG? œ —2900 + 1:20 T 
glass cryst. 


SiO; +H, -— SiO + H,O, 4G? ~ 124200 — 43:87 T (9) 
glass gas gas gas 

The arrangement used in studying reduction reaction (2) (firing a vitreous-silica 
plate in a continuous H, flow in an electrically heated furnace) can be considered 
as a closed system for this reaction. However, the system is not a closed one 
with respect to reaction (1). Taking this into account in conjunction with the 
fact that the standard Gibbs-free-energy change is less positive for reaction (1) 
than for reaction (2), one would expect gaseous SiO and H,O to be formed 
rather than OH groups. 

A consideration of the rate-controlling processes may explain why never- 
theless a quasi-equilibrium OH concentration is reached. Hydrogen penetrates 
into vitreous silica at a rate approximately 1000 times greater than the rate at 
which H,O moves within the glass. Within the plate, the hydrogen reaction 
will be accompanied by the formation of physically dissolved H,O and SiO. 
The rate of removal of these products will be considerably slower than the rate 
of production. Therefore, in practice, the reactions that occur in the bulk will 
hardly be affected by the composition of the surrounding atmosphere (except 
for the H, partial pressure). 

The formation of OH and SiH groups may therefore also be considered 
as a result of the next reaction: 


SiO + H30 + SiO, — =SiOH + =SiH 
gas gas glass 


(10) 


AG? = —89700 + 58-12 T (cal). "s 
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3.3. Bubble formation when cooling a melt 


Another interesting relationship is that between log Boy and the reciprocal 
temperature: 
2 
d In K 2 d In xog -— d log Bou l 12) 
d(/T) — d(1/T) d(1/T) 
Log Box as a function of 1/T is shown in fig. 3. With the aid of this figure one 
can explain the interesting phenomenon of bubble growth when a melt is slowly 
cooled. 

In a previous paper !) it was shown that, when a sample was melted in a 
hydrogen atmosphere at 1950 °C and then cooled down to 1700 °C and held 
at this temperature for 15 minutes, a great number of hydrogen-filled bubbles 
was formed. 

It was also shown that at 1950 °C the equilibrium OH concentration is not 
reached, due to a fast evaporation process at the surface of the melt. The total 
amount of hydrogen present in the glass is mainly determined by íhe amount 
that is included during heating, sintering and melting. A typical Boy value for 
a freshly melted sample comes to 90. 107? mm-^!. It can be seen from fig. 3, 
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Fig. 3. Boy as a function of reciprocal temperature, for samples treated in a hydrogen 


atmosphere. 'The shaded area indicates the temperature range where bubble growth could 
be detected when slowly cooling a melt. 
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point A, that this value is considerably lower than the extrapolated equilibrium 
Bou value (about 120. 107? mm"). When the sample is cooled down, the equi- 
librium OH concentration is reached at 1810 °C (point B). During further 
cooling, SiOH and SiH groups will react and physically dissolved hydrogen 
will be formed until local equilibrium pressure is reached. If the reaction is not 
frozen in, the H, pressure will exceed atmospheric pressure and bubble growth 
will take place. 
Bubble-growth kinetics is proportional to the pressure difference 


AP = Prouvvie 77 4 surrounding 


and inversely proportional to the viscosity (7) of the medium in which the 
bubbles grow. During cooling, AP as well as 7 will increase. When y has 
become too high, bubble growth will stop. In fig. 3 the shaded area indicates 
the temperature range in which bubble growth could be detected. 

At lower temperatures, long times for bubble growth are necessary. There- 
fore the diffusion out of the glass mass will become considerable and con- 
sequently the geometry of the melt will become an important factor. 

When a melt is cooled down quickly, bubble formation can be prevented. 
It is not yet certain whether the failure of bubbles during rapid cooling is 
attributable to relatively low reaction kinetics or to low bubble-growth kinetics, 
or to both. 


3.5. Influence of the melting conditions on the thermochemical properties of the 
glass 


A bubble-free vitreous-silica sample was prepared by melting the raw material 
in a hydroxyl-free evacuated vitreous-silica ampoule, in an arrangement com- 
parable with the arrangement used to melt the samples in an H,/He atmos- 
phere. Hydrogen and water were avoided during melting and the molybdenum 
melting crucible was protected by an argon atmosphere. 

The sample was fired for 16 hours at 1050 °C in vacuum. The foy value of a 
sample thus treated was found to be 146.107? mm 7+. The “stable” OH 
groups originate from the raw material itself +). When the sample was fired 
in an H, atmosphere the Boy value increased. Table I gives the Boy value after 
the various treatments of the sample. The amount of “stable” OH groups 
decreases a little after every heat treatment. This is probably due to a slow 
diffusion of molecular water out of the glass plate. 

In the calculations made to determine the equilibrium constant K for reac- 
tion (2) the total OH concentration is taken into account and 


Xsin = Xoy — Xon(Stable), 


Xon(stable) = 4 (xou(stable) before reacting + Xoy(stable) after reacting). 
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TABLE I 


consecutive treatments after 


-1 3 
melting in vacuum Pon; value anm: "130 


16 hours 1050 °C/vacuum 15 
16 hours 1550 °C/H, 90 
16 hours 1050 °C/vacuum 12 
16 hours 1317 °C/H, 54 
16 hours 1050 °C/vacuum 10 
16 hours 1390 °C/H, 64 
16 hours 1050 °C/vacuum 8 


Figure 2 gives a plot of AG? as a function of the temperature (glass 2). The 
standard Gibbs-free-energy change differs slightly from the standard Gibbs- 
free-energy change for the same reaction with samples prepared in an H, 
atmosphere. The measurements are not accurate enough to enable one to de- 
termine whether this difference is to be attributed to an enthalpy and/or to an 
entropy effect. A possible explanation is that, contrary to the crystalline 
materials, the vitreous state is not defined exactly. The structure of the network 
on a molecular scale will be influenced by e.g. the preparation method. This is 
reflected in the physical properties of the glass and probably also explains the 
small difference in standard free-energy change for the reactions. 


3.5. Melting silica in an He/H, (D;)/H5O (D20) atmosphere 


The difference in stability between OH groups that are introduced accord- 
ing to reaction (2) or (4) can be used in order to demonstrate the existence 
of SiH centres as follows. 

Vitreous-silica samples were prepared by melting the raw material in an 
H,,H,0,He atmosphere. The H,,He mixture (H, 50%, He 50%) was wetted 
by leading the gas through a water bath. The temperature of the water was 
held constant at 26°C. The wetted gas was led through a vertically placed 
coiled condenser (length 1-5 m). The condenser was held at 21 °C. A dew 
point of 21 *C corresponds to a water partial pressure of 0-025 atmosphere. 
Table II gives the oncon) values of the glass samples after the various treat- 
ments. 

When during melting, at a time when it was sure that the melt was formed 
already, the composition of the gas mixture was changed (e.g. for 5 minutes), 
no perceptible change of the Borcop) values could be detected. This means 
that within the period of time that a melting experiment lasted, the exchange 
between the gases within the melt and the surrounding atmosphere was neg- 
ligibly small. In an early stage of the sintering and melting processes the sur- 


j 


m _ | 
iy 


318 G. H. A. M. VAN DER STEEN 


TABLE II 


Values of Boy and Bop, before and after firing the same samples for 16 hours 
at 1050 °C in vacuum 


after 16 hours 
1050°C in vacuum 


[om [me [me [| 
OS Sel) cee 78 = 


melting conditions freshly molten 


rounding atmosphere is enclosed between the grains and the mutual exchange 
between the gases in the melt and the surrounding atmosphere almost stops 
completely +). Therefore it is likely that the He/H, (D2)/H20 (D20) ratio of 
the gas that is trapped by the sintering grains will have the same composition 
as the surrounding atmosphere. 

The effect of water addition on the sintering properties of the raw material, 
and thus on the total amount of gases that is enclosed during melting, is not 
known quantitatively. Therefore it is not advisable to compare the Boy values 
of samples 1 and 2 (or 3 and 4) directly. It is better to make use of the (apparent) 
difference in stability of the OH groups that are introduced according to reac- 
tions (2) and (4). When sample 1 was fired for 16 hours at 1050 *C under 
vacuum, the Boy value was reduced to practically zero. When sample 2 was 
fired in the same way, the Boy value was reduced to 12. 107? mm-!. This 
value decreased very slowly when the sample was fired for several hundreds 
of hours. 

The ratio OHgtabie/OHunstavie (sample 2) is 12/94 = 0-13. The H;O/H, ratio 
of the surrounding atmosphere is 0:025/0-49 ~ 0-05. When it is assumed that 
one H, molecule produces one OH and one SiH group the expected ratio 
between stable and unstable OH groups will therefore be 0-10, a value which 
is quite well in agreement with the experimental result (0-13). 

In order to exclude the influence of small amounts of OH groups that are 
carried along with the raw material, D,,D,0,He mixtures were used (samples 3 
and 4, table IT). 

Because of the fact that vitreous silica itself possesses a fairly strong ab- 
sorption band in the 3-7-um region the intensity of the OD absorption band 
was measured by placing an OD free sample of the same thickness in the 
reference beam of the spectrophotometer. The ratio OD stapie/ODunstavie (sample 
4) was 0-11, a value which is in agreement with the ratio (0-10). 


m 
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4. Conclusions 


(1) A linear relationship is observed between the equilibrium OH concentration 
and the square root of the hydrogen partial pressure. 
(2) Hydrogen reacts with vitreous silica according to the reaction 


2 SiO, + H, 2 zSiOH + zSiH. 


In the temperature range between 1500 and 1800?C the following rela- 
tionship was found: 


AG? = 34500 + 14:25 T (cal). 


(3) When glasses are used which are prepared in completely different ways, 
the standard free-energy change for the reduction reaction may differ slight- 
ly. 

(4) The LR. absorption data of samples prepared in a wetted and dry H, 
atmosphere support a reaction mechanism according to which an equi- 
molar amount of =SiOH- and =SiH-like centres is formed during the 
reaction between H, and vitreous silica. 
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SCRATCHING EXPERIMENTS ON VARIOUS CERAMIC 
MATERIALS 


by A. BROESE van GROENOU, N. MAAN and J. D. B. VELDKAMP 


Abstract 


Using diamond indenters of square pyramidal shape (apex angle 100°, 
120°, 136°) scratching experiments have been carried out on a number 
of polycrystalline ceramic materials: ZnO, Nio.36ZNo.64Fe204, 
Mno.57ZNo.35Fe2.0804, SrFe12019 and Al2O3, and on a lead glass. 
The ratio of the tangential to the normal force is found to depend 
strongly on the apex angle, but not on the material or on the load in 
the range of 0-05 to 1 N. This finding is approximately in agreement 
with the “ploughing” model of Bowden and Tabor for homogeneous 
and isotropic materials. The experiments give information on the friction 
in the interface between diamonds and sample. The friction force on the 
material has a component in the direction opposite to that of the 
scratching particle, thus against the direction of the flow of the removed 
material along the diamond. The energy needed for the creation of a unit 
volume of the groove, i.e. the specific energy, and the scratch hardness 
are independent of apex angle in the range used, but increase with the 
Vickers hardness of the material. Moreover, the specific energy decreases 
with increasing load for the harder materials. Tangential-force fluctua- 
tions are found at all loads used. The amplitude of these fluctuations is 
linearly related to the average tangential force. The region around the 
scratches is disturbed by cracks and chips. The width of this region is 
dependent on material and load. For fine-grained materials a square- 
root dependence on force is found, for coarse-grained materials a linear 
relation. 


List of symbols 


b width of groove 

b, width of chipped region 

d depth of groove 

e specific energy 

f force ratio = F,/F, 

Jod maximal force ratio 

n coefficient for Meyer hardness 


Pm dynamic flow pressure E 
u displacement 

[A speed of diamond 

A. contact area 

A, projection of A, on horizontal plane 

A, cross-section of a groove 

C, C' constraint factors 

Fi transverse force component 
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normal force component 
tangential force component 
average of F, 

peak-to-peak amplitude of F, 
Meyer hardness 

scratch hardness 

Vickers hardness 

radius of indenting sphere 
power needed for removal 
fiow stress 

rate of volume removal 
friction angle (arctan u) 
angle (arctan f) 
upward-flow factor 

angles  - 

half of apex angle 
Coulomb friction coefficient 
angle 

normal stress at interface 
shear stress at interface 
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1. Introduction 


1.1. Grinding and scratching 


Grinding of ceramics is a complex process, involving a machine, a grinding 
wheel with abrasive particles and the surface of a workpiece. The mechanical 
action of the abrasive results in heavy deformation of the surface of the work- 
piece, part of which is loosened and subsequently removed. Research on grind- 
ing is concerned with the quantitative analysis of such parameters as the rate 
of volume removal (Z), the normal and tangential components of the force 
(F, and F,, respectively) and the power (W) needed for removal. These macro- 
scopic quantities have, for example, been measured for alumina +?) and steel >). 
The energy needed per unit volume of removed material, the so-called specific 
energy, e, is given by e = W/Z (in the American literature one frequently uses 
the symbol U or ug, in the Annals of the C.I. R.P.^) one finds e). For steel ?:3) 
the specific energy decreases by more than a factor 10 when the average depth 
of cut of the abrasive particles is increased. At the same time the mode of 
material deformation varies from light rubbing to ploughing and then to heavy 
cutting. Is the action of each abrasive particle equally relevant for grinding of 
ceramic materials? The answer is found by observing the microscopic deforma- 
tions at the beginning of the contact between a grinding wheel and a slightly 
tilted sample 5). Figure 1 shows photographs of an MnZn-ferrite sample, ground 
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at 75 m/s. One sees that first a fine scratch is made at the start of the diamond 
track; further on, the track widens and chips appear; finally, the chips and the 
track merge with those made by other particles. From these observations one 
concludes that the study of single scratches on metals and ceramics is highly 
relevant for grinding. For single particle scratching at a speed v, the rate of 
volume removal Z is equal to A, v, where A, is the cross-section of the scratch. 


a) 


Fig. 1. (a) Top: Photograph of scratches on polycrystalline Mno.57Zno.35Fe2.9g04 made 
by a number of diamonds on an electroplated wheel, running at 75 m/s. Since the sample was 
slightly tilted, the start of the scratches could be made visible. 

(b) Details of fig. 1a, showing the very beginning of a scratch and a ground region where 
several scratches have merged. 
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Moreover, the power W is given by W= F, v, so the specific energy e follows 
from 
e = W/Z = FA, 


By measuring the tangential force and the dimensions of the scratch, one ob- 
tains the e value which can then be compared with grinding data. Once the 
interaction between scratches no longer can be neglected, one expects the e value 
from grinding to bé lower. Single-point experiments have been done on steel $7) 
and alumina ':57!*) at the wheel speeds used in practice (10-60 m/s); some 
qualitative work has been done up to 500 m/s by Kapernaros +?) on steel and 
glass. A disadvantage of these experiments is the lack of sensitivity in the force 
measurement; this led some workers to quite high values of the depth of cut 
(e.g. 8 um 7+9), 25 um ?:9)). In practice, however, the depth of cut is less than 
a micron in fine-grinding; values of 10 um may be reached in coarse-grinding. 

Most grinding experiments yield values of the normal force, F,, and the 
tangential force, F,. The force ratio defined by 


f= FF, 


is easily derived. As an abrasive friction coefficient, f is frequently used to 
identify a particular type of material behaviour +°), despite the complexity of 
the experimental results. For example in dry grinding of steel?) and alu- 
mina 7:19) the values of f decrease with increasing wheel speed. In wet grinding, 
however, f is roughly constant 9). 

Since both f and e are also easily accessible in low-speed single-point experi- 
ments, it is worthwhile to investigate if speed has drastic effects on the material 
deformation. Figure 2 shows scratches made at a speed of 1 m/s under a load 
of 0-1 N and 0:5 N. In comparison with fig. 1 the grooves are not essentially 
different in spite of a speed difference by a factor 108. A number of authors have 
reported on studies of slow scratches in the speed range between m/s and 
mm/s in various ranges of normal load: on glass +14) (0-2-6 N), GaAs t5) 
(107*-3.107? N), MgO +6) (1-10 N), various rocks !7—19) (0-5-6 N), fer- 
rites 20:21) (0-02-10 N), metals ??-?9) (1075-6 N) and tungsten carbide !?) 
(1-10 N), each author measuring F, or F, or groove dimensions, but usually 
not all of these. Except for diamond 27), the mechanical behaviour of these 
materials is similar at low loads, where a well-defined: groove is found. Diamond 
and, at higher loads, the "brittle" ceramics, show cracking and chipping out- 
side the grooves. We now turn to theoretical descriptions of these phenomena 
and to predictions of the force ratio and the specific energy. 


1.2. Ploughing 


Geometry and energy of the grooving are considered in the “ploughing” 
model: the shape of the scratching particle determines the dimensions of the 
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Fig. 2. MnZn-ferrite sample (shown in fig. 1); microstrukture (top left), slow scratch (0:1 N) 
(top right) and a scratch under a load of 0-5 N before (bottom right) and after (bottom left) 
etching. Apex angle 20 = 120°. 


groove; the energy needed is related to an average flow stress of the material 28). 
supposed to be homogeneous and isotropic. This model has been worked out 
for a number of geometries by Goddard and Wilman ?5) (reviewed in ref. 29) 
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in terms of average compressive and shearing stresses at the interface between 
indenter and material More recently Hisakado 7°) considered the same 
problems and used the Coulomb friction coefficient for the ratio of shear to 
compression. Shaw et al. 39:37) studied material flow underneath a sphere and 
found that the process has a resemblance to extrusion. For ploughing with a 
sphere Shaw's e value is proportional to the Vickers hardness 3°). This result 
emphasizes the close relation between hardness, specific energy and flow stress 
in the ploughing model. For the hardness, H,, is proportonet to the flow 
stress, Y: i 


H, = CY, 


where C is the (geometrical) constraint factor 31-37), An equivalent relation 
between specific energy, e, and flow stress is expected: 


e=C Y, 
but for this different geometry no estimate of C’ is available. 
The ploughing theory describes a number of experimental results quite well, 


e.g. for scratching with cones, pyramids and spheres. For a conical indenter 
(apex angle 20) the force ratio is predicted to be ?5:29) 


f = (jx) cot 6, (1) 


in good agreement with results on metals ??:25), For pyramids one has to 
specify which planes are in contact during scratching. For two leading planes, 
as shown in fig. 3b, one finds ?5) 


f = 4y2 (cot 0 + pu cosec 0), (2) 


where 4 is the Coulomb friction coefficient and 20 is the angle between two 
opposite planes of the regular pyramid. For a pyramid with one leading plane 
(fig. 3a) there are several possibilities. First, the two side planes are also in 
contact with the material and share the normal load; one finds ?9) 

cot 0 + u (1 + 2 cosec 6) 


PERR TTO c0 08 


3 Q- at- 


a) b) c) 


Fig. 3. Top view of the three types of diamond pyramids used. The dashed area is in con- 
tact with the workpiece, the arrow indicates the direction of grinding. Diamond 5 is the same 
as in a, but has been rotated through 45°. 
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which for u = 0 reduces to f= $ cot 0. Second, only one side plane is in 
contact, then ?5) 


f — X (cot Ó + u cosec 6). (4) 


Hisakado 2°) made scratching experiments with metals (steel on steel, steel on 
copper) for four @ values at speeds between 1 and 100 cm/s; he found good 
agreement with eq. (3), using u values between 0:13 and 0-19 (for eq. (4) one 
would need u = 0:06-0:26 to fit the data). 

For ploughing with a sphere the ratio of groove width to depth changes when 
the load is increased. Since the force ratio depends strongly on geometry, see 
e.g. eqs (1)-(4), one expects f to increase with increasing load. For cones and 
pyramids with rounded-off tips this is expected too. Experiments with rounded- 
off cones 13-18-19) and with flattened pyramids +°) show indeed that the force 
ratio increases with load, but a quantitative comparison with ploughing theory 
is difficult. Spheres can be made to a better precision and should therefore give 
more-useful results. The force ratio is found to be 7°) 


|. 4Baate 
— 1—(4/3x) op’ 


where sin « = b/2R, b being the width of the groove and R the radius of the 
sphere. If the penetration is small enough to neglect the terms with « in (5), 
one has f= u. Experiments with diamond spheres on steel and copper ?5) 
and on MnZn-ferrite crystals °t) agree quantitatively with eq. (4) and yield 
u values of 0-10 (steel ?9)), 0-12 (copper ?9)) and 0-11 (ferrite ?!)). 

From the preceding paragraphs it is clear that the ploughing model has been 
used extensively in the description of the deformation of metals. For ceramic 
materials, and brittle materials in general, much less information is available. 
Either the information is incomplete, so that e and f cannot both be determined, 
or the interpretation is difficult because of rounded-off points of the scratching 
particles. The scarcity of data on e makes a comparison with the Vickers hard- 
ness difficult 5:39). It was our aim to make a more complete investigation on 
ceramic materials. Moreover, since these materials are brittle, one expects the 
ploughing model to apply only to the “flow” properties of the material, but 
not to the cracking and chipping that is observed as well (figs 1 and 2), especially 
at higher loads. "Rn 


G) 


1.3. Chipping 


From experiments on rock drilling 3°) it is known that chipping leads to force 
fluctuations. In a study of the distribution of the amplitude and the wavelength 
of these fluctuations we found ?^) for various types of SrFe,,0,, a good cor- 
relation between the wavelength and the distance between grain boundaries or 
other obstacles. The question is whether such a correlation with flaw distance 


is also found on coarse-grained materials. Ogihara et al.?°) studied the amount 
of cracking and chipping around scratches on various MnZn and NiZn ferrites, 

both crystals and polycrystalline materials. The extent of cracking depends on 

the orientation of the crystallites and the direction of scratching. Recent 

theoretical work by Lawn et a1.35:39) has indicated how to incorporate the 

crack-extension force into a description of cracking around static indentations. 

Moreover, they distinguished between cracks which open up vertically (“median 

vents”) and those that lead to chips (“lateral vents”). Both types of cracks are 

observed during scratching, so one may hope that the brittle phenomena around 

scratches can soon be described more accurately than is possible now. 

A phenomenological description is used in the work reported here. Apart 
from the average scratching force, we measured its fluctuations. The dimensions 
of the groove and the extent of cracking and chipping were both measured and 
telated to the force data. Dry scratches were made on a variety of ceramic 
materials, widely different in hardness and microstructure. A number of 
pyramidal shapes with sharp points were used, e.g. three square pyramids with 
different apex angle, and a trapezoidal pyramid. A diamond sphere was also 
used. Experimental details are found in sec. 2. After the results in sec. 3, the 
ploughing theory is extended in sec. 4 to the specific energy where necessary, 
and to the trapezoidal geometry used here. After the discussions in sec. 5 the 
conclusions of this work are given in sec. 6. 


2. Experimental 


2.1. Scratching apparatus 


Scratches were made by moving a polished sample under a vertically loaded 
diamond. During the motion the horizontal force on the diamond was recorded. 
Afterwards the depth and width of the scratch were measured under a micro- 
scope. The apparatus, shown schematically in fig. 4, consists of a balance with the 
diamond underneath one arm and a sample holder driven by a motor. The 
other arm of the balance is connected to a counterweight (V) and a damper. 
The frame carries on a leaf spring S1 the arms A1 and A3. A lower arm A2 
hangs on a chain from A1 and is also connected to the central part of the frame 
by a second leaf spring S2. The springs S1 and S2 reduce hysteresis to a neg- 
ligible amount. Moreover they determine the vertical stiffness of the apparatus, 
which was measured at the load to be 250 N/m. The horizontal stiffness of the 
measuring part (2-4. 105 N/m) is determined by the bending element BE. As 
shown in fig. 5 this part consists of a rectangular bar with two slots. Across 
the upper slot eight semiconductor strain gauges SG *) are used for measuring 
the horizontal component of the force. The diamond holder fits into the lower 


*) Philips PR 9862. 
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Fig. 4. Schematic drawing of the scratching apparatus. Al, A3: upper arms, A2: lower arm; 
S1, S2: leaf springs; V: counter weight and damper; C, F: coarse and fine vertical adjust- 
ments; L: load; D: diamond; BE: bending element used for force measurements; S: sample; 
Mo: motor; E: x-y and rotation stages; Mi: micromanipulator. 


A2 


Scale 1:1 


Fig. 5. Bending element with strain gauges used for the measurement of the scratching forces. 


slot. A load L can be placed on top of arm A2, above the diamond D (fig. 4). 
Masses between 2 and 200 g were used. The strain gauges were connected to 
a carrier-frequency measuring bridge *), the output of which was recorded 
continuously. The response time of the system is 1 ms, that of the recorder 
0-1 s. The horizontal force, F,, was calibrated by means of a vertical load on 
a wire led over a wheel and connected to the diamond holder. The diamonds 
used had a diameter of about 4 mm. They were ground to the shape required. 
Three square pyramids were used with apex angles of 20 = 100°, 120° and 
136°. Most of the experiments were done by scratching with one leading plane 


*) Philips PR 9308. 
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asinfig. 3a; some measurements were done with two planes in contact, as shown 
in fig. 3b. A fourth diamond had a trapezoidal shape with 20 = 100? (fig. 3c); 
this shape was chosen to eliminate contact with the side planes, shown dashed 
in fig. 3a. The radii of curvature of the diamonds were determined under a 
scanning electron microscope. A good diamond has a tip radius of less than 
0-5 um. If a point was found to be bad due to chipping — this happened twice 
during the experiments to be described — the diamond was reground. The fifth 
diamond used had been given a spherical surface with a radius of 0-90 mm. 
All diamonds were mounted with an Au-Ta solder on a molybdenum holder. 

The sample motion is governed by a motor Mo *) connected to a micro- 
manipulator Mi and the sample holder S (fig. 4). The micromanipulator **) 
reduces the horizontal motion by a factor of 200, without hysteresis. On top 
of Mi x-y and rotating stages ***) were placed for sample movement. The 
diamond D can be displaced vertically; this is done by raising or lowering 
the frame by means of the coarse (C) and fine (F) adjustment. Since the appa- 
ratus is a sensitive seismometer, it is mounted on a damping table ****) (time 
constant 1 s). Perpendicular to the balance shown in fig. 4 a TV camera is 
mounted enabling to observe the diamond point and the sample surface on a 
monitor (750 x). 

The scratching experiments were made on the unetched surfaces; in a number 
of cases the samples were etched after scratching. The dimensions of the 
scratches were measured under the microscope; in some cases it was necessary 
to use a double-beam interference method for the depth measurement, but even 
then the accuracy of the depth values is not better than 0-2 um. Typical photo- 
graphs of the surface will be shown below. 

An experimental run consisted of first lowering the diamond until it nearly 
touched the surface; this could be observed on the monitor since the diamond 
and its image in the mirror of the polished sample surface were simultaneously 
visible. The second step was to load the diamond with the mass required. Then 
the motor was switched on for about two minutes. The sample speed used in 
these experiments was about 1 um/s, the total scratch length was 130 um. Each 
run was repeated two times on a neighbouring area of the sample. Average 
values to be used in the following were taken over the three scratches. - 


2.2. Samples 


The materials used in this investigation are various oxides. Although the 
primary aim was to use a number of ferrites with a variety of microstructures, 
ZnO, glass and AL,O, were added, thereby increasing the range of hardness. 
The ceramic materials were prepared by the usual techniques; table I gives the 


*) Eskap, Switzerland. 
**) Philips, Netherlands. 
***) Elmekanic, Amsterdam. 
****) Modern Optics, El Monte, Cal. 


TABLE I 


Data on the materials used 


preparation Vickers Meyer 


. material ee) ae temperature ae P P 53 ty hardness exponent figure 
CC) : "^ | (mm) n 


ZnO H.P. 
Pb glass **) cast 965 500 1:7 8 
Nio.36ZNo.64F e204 sintered 1250 950 1:9 6 
H.LP. 1280 800 1-9 2 
SrFe, 2019 aligned, 1250 650 1-9 Tb 
. sintered 
Al,O3 H.P. 1400 2300 17 9 


*) H.P. = hot-pressing; H.I.P. = hot isostatic pressing. 
**) Composition SiO, 349%, PbO 54-394, Na20 41%, K20 3-7%, Al2O3 2:5%, Sb203 0-5%, all % by weight. 
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preparation conditions. The surface of the plan-parallel samples were ground 
and subsequently polished with diamond powder in various stages. The finest 
polishing particles were in the range 0-2 um. The surface appeared to contain 
very small scratches only. On the polished surface two preliminary tests were 
made. The hardness was determined on a metal microscope *) fitted with a 
Vickers microhardness meter. Loads were between 0:4 and 1:0 N. From the 
load dependence of the hardness the value of n in the Meyer relation between 
load F and diagonal of indentation d, given by F — a d", could be determined. 
After etching, either thermally or chemically, the average grain size was 
measured. These data are also given in table I. 


3. Results 


Examples of scratches are shown in fig. 2 (Mn,.54Zno.35Fe2.0504); fig. 6 
(Nio.35Zno.c4Fe250,), fig. 7 (ZnO and SrFe,,0,.), fig. 8 (lead glass) and fig. 9 
(A120). In all cases a groove is seen, in some cases accompanied by cracks 
and chips. Two examples of plots of the tangential force F, during scratching 
under a given load F, are shown in fig. 10 for the MnZn-ferrite sample. For 
the 0-1-N load the F, time plot is fairly smooth and the average F, value, F, 
adequately describes the situation. A widely fluctuating curve is found for the 
higher load (0-5 N). As a measure of the fluctuations we shall use the maximum 
peak-to-peak amplitude measured on three scratches. The difference in ampli- 
tude corresponds to the appearance of the scratches in fig. 2. For the 0-1-N 
scratch a smooth groove is seen, while for 0-5 N cracks and chips occur, over 
distances much larger than the groove width. For the purpose of this paper the 
dimensions of the scratch are characterised, first, by the width and depth of 
the groove, and, second, by the maximum width of the chipped region. In the 
following the values of the average force will be given in sec. 3.1, the groove 
dimensions in 3.2, the specific energy needed for the creation of the groove 
in 3.3; the force fluctuations and the width of the chipped region are given 
in sec. 3.4. 


3.1. Average force level 


A plot of the experimental values of F,, i.e. the arithmetic mean value of F, 
taken over a scratch length of 400 um, versus the load applied is shown in fig. 11 
for three examples: ZnO, MnZn ferrite and AL,0,. F, is a linear function of 
F,, with a slope f that depends on the apex angle of the scratching diamond. 
For two values of 20, 100? and 136°, the F,/F, results for all materials investigat- 
ed have been plotted in fig. 12. A straight line can be drawn through the points 
irrespective of the material. This means that the overall behaviour is independ- 


*) Reichert, type ME - F. 


c) 
Fig. 6. (a) Microstructure of the Nig.36ZNo.64Fe2O4 sample. Average grain size about 
30 um. (b) A 100-g scratch (20 = 136°) before (right) and after (left) etching. Some of the 


cracks are curved and run through the crystallites. Other cracks clearly run along the grain 
boundaries. 


ent of material properties such as hardness and microstructure, in agreement 
with the assumptions of the ploughing model. 

A more quantitative comparison with theory can be made by considering the 
experimental values of the force ratio f = F,/F,, given in figs 13 and 14 and 
in table II. As stated in the introduction a linear relation is expected between f 
and cot 0. In fig. 13 the f-cot @ results for all materials are given for three 
values of the normal load. Although there is some scatter, it appears that the 
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Fig. 7. (a) A 1:0 N scratch (20 = 120°) on the ZnO sample. Average grain size is about 1 um. 
Porosity: 8%. 
(b) Microstructure of SrFe;,0;9 sample. Average grain size is 5 um. Porosity 8%. 


Fig. 8. Scratches on a sample of lead glass (20 — 136^). The direction of scratching is up- 
ward. On the left: load 0-1 N; on the right: 1:0 N. 
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Fig. 9. Microstructure of the Al;O; sample. Examples of scratches (20 = 100°). The loads 
are 0-1 N (left) and 1:0 N (right). 


expected linearity roughly applies. Moreover, the majority of the points lie just 
below the lines drawn with a slope of unity. This means that the quantitative 
predictions of the ploughing model are correct only if one replaces the factor 4 
in eq. (3) by, say, 0:9. Measurements made with the trapezoidal-shaped diamond 
are given in table II, together with the results for the square pyramid with the 
same apex angle (20 = 100°). Within the limits of reproducibility given for 
each value, there is no systematic difference between the two pyramids, with 
the exception of ZnO. 
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——- t(s). 


Fig. 10. Recording of horizontal force component, F,, for a load of 0-1 N (bottom) and 
0:5 N (top) on MnZn ferrite. The microscopic photos of these scratches were shown in fig. 2. 


ZnO Mnps7Zng3sFesgg0;  . Al203 
10 
= 00° : 
F+ (N) o 00° 
120 120° 
Os D o 
136 136° 
! 


0 0-5 m 0 05 10 0. 0-5 10 
—— FníN) 


Fig. 11. Average scratching force, F,, for various materials as a function of the load on the 
diamond, F,. The apex angle of the diamond pyramid is indicated in the figure. 


F; (N) 20 = 100° à f= 0% 


0 0:5 1-0 
— Fr (N) 


Fig. 12. Average force versus load for two pyramids. All materials investigated are included. 
€ ZnO, O Al203, x SrFe;20,5, O Pb glass, A (Ni,Zn) ferrite, + (Mn,Zn) ferrite. 
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10 10 


05 05 


0 
0 0:5 10 0 05 10 


—-» cot 8 


Fig. 13. Force ratio f = F,/F, as a function of cot 0 for various loads on the diamond. All 
materials are included (symbols as in fig. 12). 


1-0 28 = 100° 
cot 6=0-84 


ae ZR 
x 


0 
001 002 005 01 02 05 1 
— F, (N) 


Fig. 14. Force ratio f = F,/F, as a function of load for SrFe;20;9 and Al203. The apex 
angle is 100° (cot @ = 0-84). A square and a trapezoidal pyramid have been used. x Al,03, 
square; O Al,O3, trapezoid; +- SrFe;20,s, square: A SrFe;2019, trapezoid. 


For comparison with eq. (2) a few measurements were made with the 
pyramidal diamonds in the 45° position shown in fig. 3b. On NiZn ferrite we 
measured for 26 = 136° the value of f = 0-40 and for 20 = 120°, f= 0:55 
(cf. 0-28 and 0-41 respectively from eq. (2) for u = 0). These values will be 
discussed in sec. 4.2. 

The data given up till now give support to the ploughing model. If, however, 
one considers the f values of Al,O; an unexpected dependence of f on the 
load is found. Figure 14 compares the data for Al O, and SrFe,20,, for the 
two indenters with 20 = 100°. As can be seen, Al,O3 shows striking behaviour 
at low loads. This gradual increase of f with F, is not observed in SrFe;20,5,, 
nor in the other materials investigated in the range of loads used here. Since 
the depth of the groove in Al;O; is less than a micron (which is smaller by a 
factor 1-5-2 than in the other materials), one has to consider the finite sharp- 
ness of the diamonds. As mentioned in sec. 2.1, the radius of the diamond tip 
is 0-5 ym, so for shallow indentations the indenter is not a pyramid, but an 
(approximate) sphere. Such rounding-off is equivalent to a higher 0 value and 
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TABLE II 


Experimental values of force ratio f = F,/F, for the square and trapezoidal 
pyramids (both have 20 = 100°, cot Ó = 0-84) and the spherical diamond. 
The force ratios of the latter are closest to the Coulomb friction coefficient yw. 
A Coulomb friction coefficient for the trapezoidal diamond, u (trap.), has been 
calculated from fitting the f value to eq. (7). The uncertainty given indicates 
the range of experimental values from three runs 


force ratio f 


friction 


material 
y (trap.) 


square 


ZnO 
Pb glass 
Nig.36ZNo.64F e204 
Mno.57Z0o.35Fe2.9304 
SrFe,5,0,5 

Al;0; *) 


*) For loads of 0-1, 0-2 and 0-5 N. 


thus, in the ploughing model, a lower f value is expected. Of course, at higher 
loads, the tip is less important and the pyramidal values should apply. The 
experimental values of the groove depth will be given in sec. 3.3. 

Finally, to conclude this section on the average force values, results will be 
given for the diamond hemisphere. For loads up to 1 N the force ratio increases 
somewhat with load and is much smaller than for the pyramids. The f data for 
F, = 0-1 N have been added to table II. 


3.2. Groove dimensions 


In contrast to the pyramid indenters the hemisphere caused no visible de- 
formation of the material unless a load of several tens of Newtons was used. 
For the pyramid indenters we have to distinguish between well-defined grooves 
where both width b and depth d can be measured, and rough grooves where 
extensive chipping, especially if it occurs inside the groove, renders a d measure- 
ment impossible. For Al;O4 and lead glass, for instance, only d values are 
available for the lower loads. The width b, however, can always be determined, 
since even heavy chipping leaves parts of the groove edge visible. 

The experimental b and d values are plotted as a function of the load F, in 
: fig. 15 for three examples, ZnO, SrFe,,0,. and Mno.57Zno.35Fe2.9304. One 
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50 ZnO SrFej30jg Mng.52 Zh g.35F 2.950; 
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Fig. 15. Width b and depth d of the grooves as a function of load F, for three materials. 
The apex angle 20 is indicated for the d values. The following symbols have been used: 

b: @, 100°; O, 120°; O, 136°; 

d: A, 100°; x, 120°; +, 136°. 

The drawn lines have a slope of 0:5. The dashed lines have been calculated from the b values 
by using d = 0-5 b cot 8. 


Fig. 16. Depth d versus width b for all samples. The apex angle 20 has been indicated. The 
drawn lines have a slope of 0:5 cot 0; the dashed lines are least-square fits to the data. 


notices that the data for b seem independent of the apex angle 0, in contrast 
to the d values. If the groove is a good copy of the indenter one expects that 
d = 0-5 b cot 0, as long as rounding-off of the tip can be neglected. A d versus b 
plot is shown in fig. 16, where the straight lines through the origin have a slope 
of 0-5 cot 0. The dashed lines are least-squares fits of the data to a linear rela- 
tion. In view of the finite sharpness of the tips (radius 0:5 um), the d values 


b 
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below 0:5 um should be handled with care. The higher d values show a fair 
agreement with the geometrical model, so, where data are missing, d was cal- 
culated from b, which is more readily measurable. The dashed lines in fig. 15 
have accordingly been based on the experimental b values. Again, in view of 
the accuracy of the data, the agreement is reasonable. 


3.3. Specific energy 


The energy needed for the removal of a unit volume of the groove can be 
calculated from the average tangential force, F,, and the cross-section of the - 
groove, 4,, by e = F,/A,. Unless the groove depth is smaller than the radius 
of the indenter tip, A, is given by bd/2 and can be calculated from the data 
given above. Figures 17 to 20 give plots of the specific energy as a function 
of load, Vickers hardness and groove depth. For 20 — 120? the e/F, plot in 
fig. 17 shows a stronger decrease of e with increasing F, for the harder materials. 
It is instructive to compare the specific energy with the Vickers hardness (table I). 
Figure 18, now presenting data for all 0, shows that e increases with hardness. 
The increase is more pronounced for lower loads. This figure also shows a large 
spread in e for Al,O3. The same scatter is visible in fig. 19, where e/F, data 
on Al;O, and ZnO are shown for three Ó values. 

For ZnO, a soft material (H, = 260 kg/mm?), the e values are independent 
of load and apex angle within the relatively small scatter of the data. For Al,O3 
e shows a marked decrease when F, is lowered. This part of the plot strongly 
depends on the diamond used. As before (cf. fig. 14 in sec. 3.1) the roundness 
of the diamond tip leads to systematic errors for low loads. Since the cross- 
section is no longer a triangle, A, is smaller than bd/2 and e is correspondingly 
larger. Since the tip shape is not accurately known, the data could not be 
corrected. 


S 
e 


28 =120° 


~ 
Q 
Q 


an 
Q 


Ferrites x p 
‘A. 


Pb glass NEL LN 
lnc! A 
ZnO — üÓ 21 


N 
e 


- 
Es] 


o 


— Specific energy e (J/mm%) 


5 : 
00! 002 005 04 02 0:5 1 
—— F, (N) 


Fig. 17. Specific energy for scratching, e, versus load, F,, for a Biven pyramid and a number 
of materials (symbols as in fig. 12). 
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Fig. 18. Specific energy, e, versus Vickers hardness, Hy, for loads of 0-1 N and 1:0 N; all 
0 values have been included. 
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Fig. 19. Specific energy, e, versus load, F,, for AlzO3 and ZnO for three values of the apex 
angle 20 of the pyramidal diamond indenter. 


In presentations of grinding data, frequently the specific energy is plotted as 
a function of the average depth of cut ?:**9). A similar plot for scratching is 
given in figs 20a and b, where the depth of cut has been replaced by the groove 
depth d (where no experimental d value was available, b was used to obtain 
d from d = 0-5 b cot @, cf. sec. 3.2). Figure 20a shows the data for the “white” 
ceramics; as before, there is a more pronounced dependence on d for the harder 
materials. (Though not allowed, some e values from grinding various steels, 
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Fig. 20. Specific energy, e, versus depth of scratch, d, for all apex angles. For clarity of pres- 
entation the white ceramics are shown in (a) separate from the black ceramics (b). The values 
labelled steel (w) (a) have been taken from grinding results of Brecker and Shaw (ref. 6). The 
black squares for A150; refer to measured d values, the open squares have d calculated from b. 


taken from ref. 6, have been added to this plot. It is surprising that the grinding 
results on these metals are of the same order of magnitude as the data from 
low-speed scratching on Al;O,.) Figure 20b shows the data for the ferrites; 
no systematic variation with apex angle or microstructure or composition is 
found. 


3.4. Force fluctuations and chipping 


As a measure of the force fluctuations will be used here AF,, the value of 
the maximum peak-to-peak amplitude of. the F,-time recordings, taken over 
three scratches. An example of this definition was shown in fig. 10. Similarly, 
the extent of chipping will be measured by b., the maximum width of the chipped 
region on the surface; again the maximum value is taken over three scratches. 
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In thís way, all visible damage is included, even when the chips are still par- 
tially connected to the material. It does not lead, however, to a measure of the 
depth of chipping, so it is not possible to calculate the volume removed by 
brittle chipping, therefore no specific-energy values will be available for this 
part of the scratching process. 

The AF, values thus obtained increase with increasing load F, and with 
decreasing apex angle 8. Instead of presenting the A4F,/F, plot for the three 
values of 0 and the AF,/cot 0 plot for several values of F,, we give some 
examples of AF, versus F, plots in figs 21a and b. In these graphs the data are 
no longer on separate curves for each 0 or F, value but are found on a single 
line. Down to the lowest loads a linear relation is found, as shown in fig. 21b 


Pb glass Nig.35 209.64 Fe20;, 


0 0:5 i 0 Os 1 


(lower scale) 


0-00! 0-01 04 1 


Fig. 21. Maximum amplitude of force fluctuation, AF;, versus average scratching force, 
F, for various materials. For SrFe; 50,9 and Al2O3 a double logarithmic plot has been used 
to show that the relation extends over two orders of magnitude. @ 100°, x 120°, A 136°. 
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for AL,O; and SrFe,;0,, in a double logarithmic plot. Two aspects of these 
results should be stressed. First, there is apparently no sign of a threshold for 
fluctuations to appear in the load range considered; second, the AF,/F, relation 
seems to be the same for all materials investigated. 

The results for 5,, the width of the region where brittle chipping and cracking 
are found, have been plotted against F, in fig. 22 for two materials, SrFe,.0,9 
and Mn,.5;Zno.55Fe;.9404. For the hexagonal ferrite b, is found to depend 
on tangential force as F,*. This behaviour was also found in ZnO and 
Nio.36Zno.c4Fe;O4. For the MnZn ferrite the b,/F, relation is linear (fig. 22), 
which was also found for lead glass and for the Al;0, sample. In view of the 
linear relation between F, and AF,, the b,/AF, plots are similar to fig. 22 and 
are not given separately. 

High-speed single-diamond-scratching experiments ^71?) yield information 
about the maximum ~, value that occurs in a scratch. In a similar way the 
maximum F, values found in our low-speed experiments were used for a plot 
of the maximum force ratio finas = F; max/F, versus groove depth d as shown 


100 


€ 
e 


N 
ce 


~ 
QO 


-N 


——» Width of disturbed region be (um) 
~ or 


Q 
à 


28 100? 120? 136? 
SrFej O19 
02 Mng.57 Zhg.35F 2.09% 


Übor o02 005 01 02 085 1" 


Fig. 22. Width of chipped region outside groove versus average tangential force, F, for fine- 


grained SrFe,204o9 (left-hand scale) and coarse-grained Mno.57Zng.35Fe2.9304 (right-hand 
scale). The apex angle 28 is indicated. 
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in fig. 23. fmax is constant up to a depth of 10 um. Higher loads are necessary 
to cover the ranges of depth used by Prins!?) (8-72 um) and Gielisse 9) 
(25-75 um). 


0 5 10 
——» d (um) 


Fig. 23. Maximum force ratio, fmax, versus depth of scratch, d, for all materials investigated. 
The apex angles are indicated. 


4. Ploughing theory 


Before the experimental results can be compared with theory, the ploughing 
model must be extended to include the specific energy, which is usually not 
given explicitly. In this model one needs the average stresses perpendicular to 
and in the interface, the compression (o) and shear (Tt) stresses respectively. 
The shear or friction stress can be described by v = wo in which yu is the 
Coulomb friction coefficient. The direction of the friction stress at the inter- 
face is determined by the direction of relative motion of the material and the 
interface. Two cases can be distinguished: first, the indenter moves over the 
material without appreciable flow, as in pure Coulomb friction; second, the 
material that is ploughed out moves over the indenter both upwards and side- 
wards. For the trapezoidal pyramid (fig. 3c) there is one contact plane presum- 
ably with upward flow only. Here the friction force is upward if the material 
flow is less than the indenter displacement, and downward in the opposite case. 
More ambiguous is the situation for two leading planes (fig. 3b) and the results 
depend strongly on the flow direction that is assumed. 


4.1. Pyramid with one plane in contact 


The case of one plane of contact is shown in fig. 24. The measured tangential 
force, F,, and the vertical load, F,, can be combined to give the total force, 
Fo. Decomposing Feot into a force perpendicular to the contact area, F,, and 
a parallel component, F, we have introduced the two main physical processes 
occurring: first, compression by F, leading to failure of material in front of 


- 
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Fig. 24. Diamond geometry showing the contact area AB. The load, F,, and the measured 
tangential force, F,, are combined into the total force F,,,. The components perpendicular 
(F,) and parallel (Fj) to the contact area can easily be deduced. The force ratio f equals 
tan f. The friction coefficient is u = tan a. 


the leading plane of the diamond, and second, Coulomb friction between the 
material and the diamond by Fi. If the friction coefficient is u, one has 


p = tana = FyJF,, (5) 


thereby defining a. 
Introducing the angle B between F, and Fio one has (fig. 24) 


a+ B+ 4 = 90° (6) 


from which 


cot ü— u 
f= FF, = tan B = cot (0 + «) = —————. (7) 
1+ u cot 0 


In fig. 24 conditions have been chosen such that f is smaller than cot 0, i.e. 
a has been taken positive, in qualitative agreement with most of our obser- 
vations for the square pyramids (fig. 13) and all of our results for the trapezoidal 
diamond (table III). A positive « means that the friction force on the material 
points downwards (fig. 24), i.e. the material that has been removed is moving 
upwards with respect to the diamond. This is actually observed during our 
scratching experiments on these materials. In our experiments with a spherical 
diamond, however, very little if any material is “pushed” out and the friction 
force on the material (now equal to F;) is in the direction of the diamond dis- 
placement. It is also possible to increase the load on the sphere to such an 
extent that material is removed. This situation is considered in Shaw’s theory 
of grinding °°), where a chip is created in front of the sphere. Here again, one 
would expect the friction force on the material to be directed downwards 
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against the material displacement. Shaw used the opposite direction for the 
friction, which has far-reaching consequences for the specific energy, as will 
be shown below. 

The specific energy e is defined by the work done to remove unit volume of 
material, so one has 


e = Fil, (8) 


where A, = bd/2 for a pyramid of width b and depth of cut, d. By analogy 
to e one may define the scratch hardness H, by 


H, = F,lAns (9) 


where A, = b?/4 is the projection of the force-bearing surface on the plane 
perpendicular to F, 

Equations (8) and (9) can be expressed in terms of F, and F, by the fol- 
lowing transformations: 


F, = F, cos 0 — F sin 6, 


F, = F, sin 0 + Fy, cos 0. (10) 
Moreover A, and A, are related to the contact area A, (AB in fig. 24) by 
A, = A, cos 6, 
A, = A, sin 8. (11) 
Defining now o and t (equal to s in Goddard's notation ?5)) by 
o =F, /A,, 
t= FA, an 
and using the friction coefficient u = F/F, = t/o one has from (8)-(12) 
e =c (1 — u tan 0), (13) 
H, = o (1 + uw cot 0). (14) 


For the 9 values used here and the experimental p values given in table IT one 
sees that the influence of the second term in these equations amounts to less 
than 15%. This agrees with the absence of a 0 dependence for the specific 
energy (figs 17, 19, 20) and the scratch hardness that can be deduced from the 
b values shown in fig. 15. This also means that no distinction between e and 
H, needs to be made here. In experiments with larger apex angles, however, the 
0 dependence may be much more pronounced. For scratching metals Brookes 
et al.??) used conical indenters with a maximum 20 value of 170°. With a 
modification of eq. (1), 
2 p 
f ee BPm 


a H, 


cot 0, 
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where pm is the dynamic flow pressure and H, the scratching hardness, they 
found that p,//H, starts deviating from unity above 20 = 150°. This might 
be related to friction effects. 

For a spherical indenter, as considered by Shaw ?9), 0 is nearly 90? and so 
e should depend strongly on u and 8. This is seen in Shaw’s formula (eq. 26 
of ref. 30) 


e = (32/2) « [1 + (u/2) (2R/d)*1/B, (15) 


where 2R is the diameter of the spherical indenter and f) is the upward-flow 
factor, which corrects for the fraction of the displaced volume that rises above 
the original surface and is still connected to it (for ceramics this may not be 
so relevant so we put f = 1). The factor 37/2 differs from unity (cf. eq. (13)), 
because, first, the load is supported by both the material and the chip, so the 
area is larger by a factor 2; second, the force ratio is f= a instead of 
f = (4/3 x) « which is found after averaging over the spherical segment in 
contact (cf. eq. (5)). The second factor in the square brackets can be replaced 
by p/(x/2 — 0), so one has 


e = (37/2) c [1 + w/(@/2 — 0)) (16) 


‘in agreement with the 0 dependence of eq. (13). For 2R/d = 100, as in Shaw’s 
example, the second term is Su, so the friction does become important. It has 
been pointed out already that the sign of the friction in (15) and (16) is dif- 
ferent from that found in the experiments reported here on ceramic materials. 


4.2. Pyramid with more contact planes 


Up till now only one of the pyramidal planes was assumed to contact the 
material, as in the case of the trapezoidal pyramid shown in fig. 3c. The more 
general case of fig. 3a can be treated by considering the side planes as well. 
Hisakado ?9) gave a derivation for the force ratio when the two side planes 
are in contact. The load, F,, is now distributed equally over three planes. The 
result is eq. (3), 


cot 0 + u (2 cosec 0 + 1) 
fim (17) 
3— u cot 0 


The specific energy also follows readily from ref. 26 and is given by 
e — a [1 + u (tan 0 + 2 cosec 0)]. (18) 


Goddard and Wilman ?5) calculated f and e for the generalized case, where 
the direction of scratching makes an arbitrary angle with the projection of 
the intersecting line of the planes in contact. This treatment includes the 
situation in fig. 3b, and, as a limiting case, that of fig. 3a but now one side 
plane carries half of the load. This situation is more likely to occur than that 
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where three planes are in contact. From ref. 25 one obtains, in our definition 
of 6, 


f — X (cot 0 + u cosec 6), (19) 


where the factor $ originates from the number of planes in contact (cf. Hisaka- 
do's denominator in eq. (17) for three planes). The treatment given by Goddard 
and Wilman starts from the assumption that the flow of the material is in the 
horizontal plane, so the scratch hardness is independent of u. Since material is 
removed upwards over the contact plane, this cannot be correct. 

The appendix treats the case where the friction force is parallel to the direction 
of relative motion of the indenter over the material; the flow of the material 
over the contact plane may still be chosen arbitrarily. Assuming pure Coulomb 
friction without flow, we obtain the following results. For the situation in 
fig. 3a with one side plane in contact, eqs (A.11) and (A.12) give 


f cot 0 -+ u (cosec 6 + 1) 


20 
2— u cot 0 o 
to be compared to eq. (19), and 
1+yu(sin@+1 
n RADO (21) 
cos 0 
For the symmetrical situation of fig. 3b the Goddard result 25) is 
cot 0 + u cosec 0 
= ——____—__,, 22 
f yi (22) 
e — o (1 + u sec 6), (23) 
where the appendix gives ((A.15) and (A.16)): 
cos 0 + u (1 + sin? 6)* 
fus ee, Q4) 
2* sin 0 [1 — u cos 0 (1 + sin? 60)7 5] 
e — o [I -+ u (1 + sin? 0)*[cos 0)]. (25) 


Comparing the Goddard results in eqs (19), (22) and (23) with those derived 
here, (eqs (20), (2D), (24) and (25)), one notices the stronger influence of the 
Coulomb friction in the present results. It should be remarked that the for- 
mulae (17) to (25) indicate that fand e increase with increasing friction, whereas 
(7) and (13) show a decrease because the opposite direction of Fi, had been 
assumed. The problem of the material transport over the interface is also found 
in the theory of hardness indentations. However, more experimental informa- 
tion is obtained from scratching where one measures two force components 
instead of one. 
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5. Discussion 


The experimental results will now be compared with theory. The ploughing 
model predicts the values of the average force ratio and the specific energy once 
the indenter geometry and the interface friction are known. The present section 
discusses the number of contact planes of a pyramid and the friction one needs 
for agreement with theory. Finally some remarks will be made about the fluc- 
tuations in the scratching force and the extent of chipping and cracking. 


5.1. Force ratio ` 


A quantitative comparison with theory needs the value for the Coulomb 
friction coefficient, u. The experiments with the spherical diamond yield f values 
given in table IL. Since there is no visible material removal at these low loads, 
one may consider these values as representative of the Coulomb friction. In the 
following these values will be compared with the u necessary for fitting the 
various theoretical expressions to the experimental data. It is realized, however, 
that friction with a ball over a contaminated surface and in the pyramidal plane 
over a freshly created surface may differ. 

First consider the geometry shown in fig. 3a. For Hisakado's result in eq. (17) 
one needs a rather high u value to compensate for the large denominator. 
Taking from table II an experimental value of f = 0-8, one needs u = 0:36 
for agreement with eq. (17). Such a high value of yw is in contrast to Hisakado’s 
experimental results for scratching on metals (v, = 1-100 cm/s, steel on steel, 
Steel on copper); his f values do agree with the factor 4 in eqs (3) and (17), 
and a positive u value of 0-14. The sign of u in the present work was appar- 
ently negative (as drawn in fig. 24). 

Goddard and Wilman's formula, eq. (19), can be fitted to the experiments 
by e.g. p = 0:28. The treatment as presented in sec. 4.2 (eq. (20)), can be fitted 
by u values in the range 0-19 to 0-29 with the exception of Al,O3 where the 
lower f value leads to u = 0-13. 

For the geometry shown in fig. 3b the experimental f values are given in table III, 
together with the u values necessary for fitting the data to eqs (22) and (24). 

Turning now to the experimental results of the trapezoidal diamond (fig. 3c), 
we have used eq. (7) to calculate y for fitting the data. These values have been 
added to table II. Using the same expression for the data on the square pyra- 
mids the u values are about the same as for the trapezoidal diamond, with the 
exception of ZnO (cf. table IT). 

From the results on y for the square pyramid with one leading plane (fig. 3a) 
we conclude that a value of at least 0:2 is necessary for fitting the f data to 
the theoretical expression if one assumes that there is more than one contact 
plane. For the trapezoidal diamond one would need much lower u values. To 
reconcile these data, it seems simpler that in both experiments there is only one 
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` TABLE III 


Data on the force ratio f for scratching Nig.3gZNo.g4Fe.O, with two contact 
planes, as shown in fig. 3b. The friction coefficients x, necessary to fit the data 
to eqs (22) and (24), are given as (22) and u(24). For u — 0 one would 
expect f to be equal to cot 0[y2. The f value determined by the diamond 
sphere has been taken from table II 


120° | 0:54 + 0-07 0-16 + 0:08 | 0-10 + 0-05 
0-41 + 0-03 0-16 + 0-04 | 0-10 + 0-02 


= hese 


contact plane. The friction on this plane is somewhat higher than follows from 
the experiments with the diamond sphere (table II). A possible contribution 
from the friction on the side planes may differ for each indenter-material com- 
bination, so it would be of interest to extend the present work to low-speed 
scratching of diamond on metals. 

From the small number of results on the case with two leading planes (fig. 3b) 
it is concluded too that the calculated friction is larger than expected from the 
sphere data (table IIT). 

An exception to this conclusion seems to be Al,O3, where low f values, 
increasing with load, were found for both square and trapezoidal pyramids 
(fig. 14). This does not disprove the model, because the geometry of the point 
is different from the rest of the pyramid and this will especially be felt at the 
very small depths of cut, d, that are found in very hard materials. Using flat- 
tened pyramids Prins +°) observed a similar increase of force ratio, although f 
became too high at larger depths of cut to agree with the predicted cot 0 
value. For spheres this increase of f with load has been found at large depth 
of cut by Lal and Shaw ??) on metals. 


f (sphere) 


0-043 + 0-005 


5.2. Specific energy 


The second term in eq. (13) can be neglected for the present range of @ values, 
so the specific energy gives directly the compressive stress on the contact plane, 
c. Specific energy, scratch hardness and indentation hardness yield data on the 
same average quantity, o; the only difference is the stress distribution in the 
material outside the contact planes. The correlation between e (and thus c) 
and the Vickers hardness that was shown in fig. 19 is therefore not surprising. 
From fig. 19 it is seen that tbe relation between e and H, is stronger at low 
F, values than at F, — 1 N. This effect is well known from grinding experi- 
ments on metals ?) (data on rocks +7) show the same trend, but we do not 
want to consider brittle chipping here). 
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For comparison the values for various steels ) have been added to the plot 
of e versus depth of cut, d, in fig. 20a. One explanation of the observed decrease 
has been given by Shaw 39) in terms of the effect of friction on spherical abrasive 
particles. As eq. (15) shows, the specific energy depends on d through the second 
term that contains u/ yd. According to the discussion given earlier, the sign 
of the friction force might well be the opposite. Moreover, this explanation 
would not hold for pyramidal indenters since eq. (13) does not depend on d, 
So a more general argument must be found. In view of the similarity between 
specific energy and hardness, one is reminded of the variation of hardness with 
load, usually expressed by the parameter n in Meyer's relation between load 
F, and indentation size d: F, — a d" or, in terms of hardness, 


Hy, = F,[d? = a d'=? = gin gi-21m, (25) 


Since n is smaller than 2 this relation describes the decrease in hardness when 
d and F, increase. 

From the e/F, plot in fig. 18 one may estimate n assuming that (24) holds 
for e. Values of 1-3 (A1203), 1:4 (ferrites) and 2 (ZnO) are found, which differ 
considerably more from 2 than the values from hardness given in table I. This 
lack of agreement might disappear in a detailed model for the constraint factor. 
Recent reviews of the possible causes of the load dependence of H, for met- 
als 38749) stress the importance of local deformations near grain boundaries 
and the surface when the load is increased. For brittle materials another ex- 
planation is possible: the cracking outside the indentation or the scratch reduces 
the elastic deformation of the “hinterland” ?9:?2), thereby lowering the con- 
straints on the plastic deformation. If this were true a more pronounced H,(d) 
relation would be expected in materials where crack nucleation and propagation 
are easier, Le. where the crack extension force is relatively low. It would be 
useful to compare the fracture toughness of the materials investigated. Per- 
haps it is necessary to stress again that our specific-energy values apply to the 
groove only; since the volume removed by chipping around the groove is not 
included, a direct comparison with grinding data is not allowed. 


5.3. Fluctuations and chipping 


We now turn to the phenomena related to the brittleness of these materials. 
Force fluctuations have been observed and the extreme values, AF,, have been 
compared to the average tangential force component, F,. As shown in fig. 21, 
AF, is a simple linear function of F,, without a pronounced dependence on 
angle 0 and load F,. Even more unexpected is the finding that the relation holds 
for all materials; moreover, there is no threshold load above which the force 
starts fluctuating. It is well known in rock drilling °°) that chipping leads to 
force fluctuations, so one has to explain, first, that chips and cracks can occur 
at any load (in the range used here) and, second, that cracking leads to an 
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increase in force (and a subsequent decrease) that is proportional to F, the 
force necessary for making the groove. On the first problem, one may remark 
that, if the stresses around pyramidal indenters are sufficiently large to nucleate 
a crack, this will certainly happen somewhere along the scratch. The reason is 
that a moving indenter always encounters defects in the material that can 
develop into a crack. For a static indenter, as used in a hardness experiment, 
this is not so and cracking should be more a random process. (This problem 
was considered for ring cracking under a loaded sphere by Oh and Finnie **).) 
The stresses around an indenter decrease from approximately the value o at 
the interface to zero at large distances. The experimental values of o (= e), 
shown in figs 20a and b, are 5. 10? to 5.101? N/m? (500 to 5000 kg/mm?), 
which is presumably sufficiently high for cracking. The second problem is to 
explain a proportional increase in tangential force. Crack growth, as observed 
here, requires an increase in force, because the situation is similar to that of 
ring cracking around a spherical indenter ?5:*?). It is by no means clear, how- 
ever, that this increase should be the same fraction of F, for all materials, 
especially if the differences in chipping and cracking are taken into account. 
As shown in fig. 22 two types of behaviour have been found. In the first type 
the extent of chipping, b., depends on the average tangential force as VF; 
this is found for ZnO, Nio.36Zno.64F€204 and SrFe,.0,9. For the second 
type b, is linear in F, (Al2O3, Mno.57ZMo.3sFe2.9g04 and lead glass). In each 
group of materials various microstructures are found: ZnO and SrFe,;;0,, 
(fig. 7) are porous, fine-grained materials where cracking occurs along grain 
boundaries and the chips consist of whole grains. Nig.35Zno.;4Fe;O, has larger 
grains, a low porosity, and cracking is partially along grain boundaries. Figure 6 
shows that loosened crystals have disappeared after etching, so here too the 
chips can be whole grains. As in glass (fig. 8) one finds in Mno.57Zo.35Fe2.9304 
(fig. 2) that cracks propagate through the (large) crystallites; when a crack 
crosses a grain boundary, one seldom finds propagation along the boundary. 
It is remarkable that the nearly pore-free, fine-grained A1;0, is found in the 
second group. Cracking occurs over distances much larger than the grain size 
(fig. 9) but in and around the groove very fine particles are found. It would 
be worthwhile to study scratching for a material where the grain-boundary 
properties can be varied. In combination with experiments on the crack ex- 
tension force and a model for the stresses around a pyramid, one may come 
to a better understanding of the cracking around a groove. 


6. Conclusions 


6.1. Ploughing model 


This model predicts the force ratio, the specific energy and the scratching 
hardness for homogeneous materials, once the indenter geometry and the inter- 
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face stresses are known. Special attention has been paid here to a pyramidal 
indenter with one plane in contact. Moreover, Goddard and Wilman's result 25) 
for two contact planes has been modified for a more general treatment of the 
shearing force at the interface. The crucial assumption here is the direction 
of this force. 


6.2. Force ratiof . 


For a diamond hemisphere the force ratio was found to be low (0-04-0-10). 
This type of scratching leaves no deformation on the surface ("rubbing"), so 
the f value may be used as a measure of the Coulomb friction coefficient, B. 

For sharp-pointed pyramids the f values are much higher (0-4—0-9) and are 
independent of load, hardness or microstructure of the scratched material. In 
the load range used here, f is approximately given by the “ploughing” relation 
f = cot 0, where 20 is the apex angle of the pyramid. 

There is no significant difference between the f data for a square pyramid 
and a trapezoidal pyramid. This strongly suggests that the number of contact 
planes for a square pyramid is equal to unity too. For the pyramids with one 
leading plane f is smaller than cot 0, so the friction force on the material is 
opposite to the diamond movement. The y value is roughly equal to the f value 
measured with the spherical diamond (table II). For the pyramids with two 
leading planes f is larger than predicted (table III), so the direction of the 
friction force is opposite to that for the single contact plane. 

A1;0; gave much lower f values than the other materials. This may be caused 
by the roundness of the tip of the pyramids. 


6.3. Specific energy e 


The value of e does not depend on 6 within the experimental accuracy, but 
it does depend on the material. There is a good correlation with the Vickers . 
hardness. The e-H, relation, however, depends also on the load on the diamond, 
or, as it is usually expressed in the literature on grinding, the specific energy 
depends on the depth of cut, d. The e/d dependence is strongest for the hardest 
material tested, Al;0; (H, = 2300 kg/mm?) and practically absent for ZnO 
(H, = 260 kg/mm?). The ferrites investigated (H, = 600-900 kg/mm?) show 
an intermediate behaviour. For pyramidal indenters the e/d dependence can- 
not be explained by Shaw’s theory for spherical diamonds °°) where e decreases 
with increasing d because friction is more important on a larger curved contact 
area. The e/d relation seems to be more general and is possibly related to the 
amount of cracking around the scratch, which is also more pronounced in the 
harder materials. 


6.4. Fluctuations 


Down to the lowest loads used, fluctuations in the tangential force F, have 
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been found. The largest amplitude is a linear function of the average value 
of F,, which function is practically independent of the material scratched. The 
width of the disturbed region, however, does depend on the material, especially 
its microstructure. Given a workable theory for chipping on this scale, the 
scratching method could well be used for characterizing materials, both for 
strength and machinability. 


6.5. Suggestions for further work 


The work on the force ratio could be extended to the scratching of metals, 
in order to see if there are differences in the interface friction and in the specific- 
energy/depth-of-cut curve. For brittle materials more data on the fracture 
toughness are necessary, which can then be correlated with the specific-ener- 
gy/depth curve and the force fluctuations. 


Acknowledgements 


The authors are indebted to various colleagues at the Philips Research 
Laboratories for supplying materials, to Mr P. F. Broekxman for polishing 
the samples, to Mr Th. Zeegers for grinding the diamond pyramids and the 
hemisphere, to Mr A. H. C. Vliegen for performing the hardness measurements, 
to Mr P. J. Rankin for setting up this experiment during his stay at our labora- 
tory, and to him, Messrs R. J. Klein Wassink and P. J. Gielisse for stimulating 
discussions. 


Eindhoven, October 1975 


Appendix 


The influence of the friction in the contact planes is considered here for the 
case of a regular square pyramid moving in an arbitrary direction. Let the dis- 
placement of the pyramid be u (fig. A.1). For each of the contact planes u can 
be resolved into one component perpendicular (u3) and two components parallel 
Uy, Uz, Vector sum u to the plane. We assume that a normal stress ø on the 
contact plane is necessary and sufficient for ploughing. The tangential stress on 
the material is then given by t = u o, where u is the Coulomb friction coeffi- 
cient. The direction of t will be discussed below. With the expressions for the 
contact area A, and the projected areas A, and A, the forces on the material 
are now completely determined and can be decomposed to give the vertical 
and horizontal components F, and F,. The specific energy e and the scratch 
hardness can also be derived. 

We first consider the contact plane EAD, where A is the tip of the pyramid 
and DE lies in the surface of the material. From fig. A.2 it is seen that the 
displacement u, here taken equal to CQ, has a component HQ along the 
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contact plane AED and a component CH perpendicular to AED. Since the vector 
HQ isthesum ofthe vectors HB and BQ the angle o between HQ and BQ follows 
from (fig. A.3) 


tan o = HB/BQ = cot y, sin 8. — (AL) 


Fig. A.1. Top view of diamond geometry for the case of two planes (FCE and ECD) in 
contact. The displacement vector is indicated by x. 


SS 
X ; 


AED contact plane 
Fi CED x-y plane 


Fig. A.2. Cross-section of one quarter of a square pyramid (apex A, horizontal plane CED) 
showing the contact plane, AED, the x, y and z axes and the two components of the force 
on the material, F, and Fy. Note that here the friction force Fp is parallel to the 4 compo- 
nent in the contact plane, in contrast to fig. 24 where F, y has the opposite direction. Taking 
CQ = u, one has CB = u cos y and BQ = usin y. 


y 
Q 
j u, 
a Up 
H u; B 


Fig. A.3. Detail of fig. A.2 showing part of the contact plane, where u is the component 
of the displacement u. Since u, = u cos y sin 0 and uj = usin y one has tang = cot y sin 0. 
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If F, is the value of the perpendicular force, the friction force Fp is given by 
F, = u F,. The direction of F; has now to be decided upon. (F, is parallel 
to CH, i.e. u3). If the only motion is that of the indenter, the vector F; is 
parallel to HQ, i.e. uj. If, however, the material flows over the interface, 
the direction of the friction is opposite to the relative motion of material and 
indenter. An example was shown in fig. 24, where Fp points downwards, whereas 
the displacement u; is upwards. In cases where the material displacement vector 
v is not zero, the friction force will be parallel to the vector difference of u 
and v. This vector is different from the one drawn for u in fig. A.2, where 
u = HQ; consequently the angle o is no longer given by (A.1). Instead, one may 
take tan o = Kcot y, sin 8, where K may be positive (K = 1 for no flow) or negative 
(cf. fig. 24). The following formalism does not depend on the value of o. 
For the force components (F,, F,, F,) along x, y, z one has 


Fi: (F, cos 0, 0, — F, sin 6), 
Fi: (F; sin o sin 6, F, cos o, F sin o cos 0). 


For the contribution to F, one needs the z components, for F, the compo- 
nents along u. The transverse component F, is perpendicular to u in the 
horizontal plane: 


F, = —F, sin 0 + uF, sin o cos 6, (A.2) 
F, = F, cos 0 cos y, + u F, (sine sin 0 cos y, + sin y, cos o), (A.3) 
F, — —F, cos ô sin y, + y F, (—sing sin 8 sin y, + cosy; cos o). (A.4) 


For the second contact plane, EAF, one obtains similar expressions with 
ya = 90? — y,. The force ratio f= F,/F, for the total forces becomes, with 
y, = y, and definition (A.1) for tan o, 


cos 0 (sin y + cos y) + u [(sin? y + sin? 8 cos? y)* + (cos? y + sin? 0sin? y)*] 


sin 0 [2 — pcos 0 (tan? y + sin? 0)7* + (cot? y + sin? 0)7*] (A 5 


The area of each of the contact planes is 
c = d? tan 0[cos 0, (A.6) 


where d is the penetration depth of the scratching point. The projection of each 
contact area on the horizontal plane is 


A, = d? tan? 8. (A.7) 
The projection on the vertical plane of both contact planes is 


A, =4bd = d? tan 0 (sin y, + cos 7). (A.8) 
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Introducing c, the normal stress, by o = F,/A,, the forces can be expressed 
in o and y. 

The specific energy e, given by F,/A,, becomes, after summing over the two 
contact planes, with y = 7, i 


1 + u [(sin? y + sin? 8 cos? y)* -+ (cos? y + sin? 0 sin? y)*] 


i (A.9) 
cos 0 (sin y + cos y) 


The scratch hardness, H,, given by F,/24,, becomes 

H, = o {1 —(u/2) cos 0 [(tan? y + sin? 8)-* + (cot? y + sin? 0) *]). (A.10) 

Special cases of interest are y = 0? and y = 45°. For y = 0° one has 
e cos Ó + u (sin 0 + 1) 


(A.11) 

sin 0 (2 — u cot 0) 
e — o [1 + p (tan 0 + sec 0)], (A.12) 
H, = o (1 — 4u cot 0). (A.13) 


Although we started from two contact planes for y 4 0° the limit for y = 0° 
is ambiguous because it is not clear which is the second contact plane. It could 
well be that both side planes contribute each one half and then eqs (11)-(13) 
apply. Equation (12) for f may be compared to Hisakado's expression ?9) for 
the case of three pyramidal planes in contact: 


cos 0 -+ u (sin 0 + 2) 
di sin 0 (3 — u cot 0) 
By simply replacing the numbers 2 and 3 in this expression by 1 and 2 one 


obtains eq. (A.11). Changing the 2 and 3 into 0 and 1 the formula for one 
contact plane becomes 


(A.14) 


cos 6 + usin 0 


sin 8 — u cos 8 ' 


which is equal to that given in the text (eq. (7)) if one changes the direction 
of the friction by using —p. 
For y = 45? eqs (A.5), (A.9) and (A.10) become 


cos 6 + u (1 + sin? 6)* 


TEARS ATT. I ADR UTIQUE? A.15 

n 2* sin 0 [1 — 4 cos 0 (1 + sin? 0)7*] (A13) 
1 1 + sin? 0)? 

c. DE (A.16) , 


cos 6 


H, =o [1 — u cos 0 (1 + sin? 0)*]. (A.17) 
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Only for y = 45? is the total transverse force, F4, equal to zero in a model 
where two contact planes are considered. If the friction force is not taken along 
uj, One must use a different o value. Instead of (A.15) one then obtains 

cos 0 + u (1 + Ksin? 0) (1 + K? sin? 0)7* 

f= See ee eee : (A.18) 

2* sin 0 [1 — K cos 0 (1 + K? sin? 0)7 t] 
For negative K the f value is reduced from its value for u = 0, so for com- 
parison with experiment K has been put equal to unity (cf. sec. 5.1). 
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NEW RESULTS IN ONE-PORT SYNTHESIS 


by V. BELEVITCH and Y. GENIN 


Abstract 


One-port synthesis is treated by a combination of Oono-Yasuura and 
state-space methods, thus yielding a number of new results different 
from Brune and Darlington, and a deeper insight in the problem. The 
invariant numbers of inductances and capacitances in all minimal 
realizations, and the minimum number of resistances, are explicitly ob- 
tained by physical arguments. The large class of impedances accepting 
a minimal realization with only two resistances is characterized and 
an easy synthesis process is presented. Continued-fraction expansions of 
lossless 2-port and 3-port hybrid matrices are used to obtain state-space 
realizations with the minimum number of elements. 


1. Introduction 


Any realization of a prescribed impedance of degree m must use a total of 
at least m reactive elements. The Brune synthesis is minimal in this respect, but 
the problem of finding all minimal passive reciprocal realizations for a prescribed 
positive real function is still unsolved. The Oono-Yasuura (OY) method !) 
yields, in principle, all realizations but is only an existence theorem involving 
an intermediate non-constructive step (discussed in sec. 5 of this paper) in its 
proof. The state-space (SS) synthesis also yields all realizations but generates 
a universe of unphysical realizations from which the physical ones can only be 
sorted out numerically a posteriori ?). Both the OY and the SS methods have 
been developed for the general (even non-reciprocal) n-port case; by restricting 
our investigation to the reciprocal one-port case and by combining both ap- 
proaches, we obtain a number of new results. 

Let 4 and y denote the number of inductances and capacitances, respectively. 
For a minimal realization, one has 


A+y=m. (1) 
In the OY synthesis it is proved that the difference 
y—A-t Q) 


is also invariant for all reciprocal minimal realizations, so that A and y are 
separately invariant ?). Moreover, an algorithm for computing (2) is implicit 
in the OY process. The invariance of the reactance signature (2) was later 
proved by the SS process *), from which Forster *) deduced an explicit ex- 
pression of t in terms of the Cauchy index of the prescribed impedance. 

It is known from the OY synthesis, and from examples treated by the SS syn- 
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thesis 5), that the number o of independent resistances is generally not invariant 
for minimal realizations. In the SS synthesis, a one-port is realized as a constant 
(m + 1)-port closed on m unit reactances, so that one has 


e<m+l. (3) 


In particular, the upper bound in (3) is attained for general RL and RC one- 
ports. At the other extreme, all minimal realizations of lossless impedances 
(Foster functions) give o = 0. Also, all minimal realizations of impedances 
z(p) having all their transmission zeros, i.e. poles of z or roots of 


zt+2z=0 (4) 


(where z, is z(—p)) on the j-axis, give 9 = 1. This is the case where the Brune 
and the Darlington syntheses coincide, because there are no intermediate resist- 
ance extractions in the Brune process. Although the Darlington synthesis 
always yields ọ = 1, it is generally not minimal; it is minimal when all roots 
of (4) outside the j-axis have even multiplicities 7), but it then differs from 
the Brune synthesis which generally yields o > 1. 

Some indication on the minimum number ọ of resistances necessary for the 
realization of various classes of impedances results from a count of the number 
of parameters. A general rational function of degree m has 


N=2m+1 (5) 


parameters. By contrast, a Foster function (which is odd) has only m parameters. 
An impedance realizable with @ = 1 has the number of parameters of a loss- 
less reciprocal 2-port of degree m which is 9) 


a l 3m[2 + 1 | for E. (€ 


T lam + 1/2 “Todd 


and is situated midway between m and (5). For an impedance realizable with 
@ = 2, the realization is a lossless reciprocal 3-port of degree m closed on two 
resistances. The number of parameters of the 3-port?) is 2m + 2 but an 
orthogonal transformation on the terminating resistances, which depends on 
one arbitrary parameter, reduces the number of parameters to (5). As soon as 
@ 22, the number of parameters in the realization exceeds the number (5) 
of parameters in the specification. Impedances with o — 2 thus form a class 
of particular interest: they have the same number (5) of parameters as general 
impedances (which is not true for o — 2) and their realizations allow no degree 
of freedom (which is not true for realizations with o 2). 

A synthesis process is canonic for a class of impedances depending on N 
parameters when it yields networks containing at most N elements. Since a 
minimal realization uses m reactances, there remains, by difference from (5), 
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ne 


a total of 
otv=m+1 (7) 


constant elements, where v is the number of transformer ratios. A realization 
is in state-space form when it consists of an ideal transformer network with 
inductances connected at the series ports, capacitance at the shunt ports, the 
external ports and the resistances being arbitrarily distributed between the two 
kinds of ports. The Brune synthesis, and the Darlington synthesis when it is 
minimal, are canonic but not in state-space form. 

In sec. 2 of this paper we give a simple physical proof of Foster’s expression 
for (2) and obtain some restrictions on o. In secs 3 to 5 we review and complete 
the OY synthesis for one-ports and obtain an explicit expression for the mini- 
mum value of o. In secs 6 and 7, the class of impedances realizable with o = 2 
is fully characterized and an explicit OY synthesis is obtained. In secs 8 to 10 
we obtain canonic state-space structures for those one-ports which are realizable 
with o < 2 resistances. 


2. The reactance signature 


We designate by 
s = hjg (8) 


the reflection coefficient of the prescribed impedance; (8) is bounded real and 
irreducible of degree m, with g monic. If s has a minimal realization containing 
o resistances, obtained by any method, that realization can be considered as a 
lossless reciprocal n-port with 


n-po-dl (9) 


whose last ọ = 1 — 1 ports are closed on unit resistances. The scattering 
matrix S of the lossless n-port is symmetric paraunitary and its entry 5, , is (8). 

When the one-port of reflectance (8) is closed on a unit resistance the tran- 
sients of the resulting closed network correspond to the poles of (8), i.e. to the 
zeros of g. For the n-port of matrix S, the wave equation b = Sais equivalent 
to a = S. b and becomes 


S. b—0 (10) 


in the absence of excitation. Consequently the zeros of det S, must coincide 
with the zeros of g, and one has 


det S = + gg. (11) 


If at some zero of g, (10) accepts a solution vector b whose first entry is zero, 
the corresponding transient decays only in the last n — 1 resistances and is 
inobservable and uncontrollable from the input. Equivalently, the n-port of 
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matrix S then contains an all-pass (n — 1)-port at its output °), which is not 
allowed for a minimal realization. Consequently S. has rank n — 1 at every 
zero of g, else several independent solutions 5 of (10) would exist and com- 
binations with b, = 0 could be formed. 

The matrix S. is regular real and symmetric for all real negative p. For p fixed, 
it can thus be diagonalized by an orthogonal transformation 


Q's.Q— A. (12) 


Moreover A is non-singular with S., except at the zeros of g, where A has only 
one zero entry because rank S. = n — 1. Computing 47+ from (12), replacing 
S~! by S and solving for S, one obtains 


S=QDA-1Q'. (13) 
Now, (8) is the entry 11 of (13), ie. 


w? 0,5? 

sS n + i cvs (14) 
In the following we distinguish the notation sign M for the signature of a 
matrix of dimension n (which is an integer between —n and n) from the usual 
notation sgn s for the sign (+1, —1 or 0) of a scalar. By (12), sign S. = sign A 
is constant on the negative real axis between two consecutive zeros of g. The 
signature variation when p passes through a zero of g is identical to the varia- 
tion of sgn 4, where A, is the only eigenvalue which vanishes at that zero. 
Since a zero of g corresponds to a pole of (14), the principal part gives 
sS = cj ?[AÀ, and one has sgn s= sgn A, In consequence, the variation of 
sign S, when p passes a zero of g is identical to the variation of sgn s. By 
adding the contributions of various zeros of g (poles of s) in an arbitrary 
interval « <p <f of the negative real p-axis, and using the classical nota- 
tion 1°) for the Cauchy index of a rational function, one obtains 


3 [sign S.(f) — sign S.(a)] = I a S(p). (15) 


It is known since Oono-Yasuura that the invariant (2) of a lossless n-port 
of scattering matrix S is given by 


T = $(80o — a) (16) 


where og (Cœ) denotes the trace of S at p = 0 (p = œ). This is because at 
these extreme frequencies the lossless n-port reduces to an ideal-transformer 
network whose rank determines the number of independent capacitances 
(inductances), and because S has only + 1 eigenvalues so that the network 
rank is simply related to the trace of S, which is equal to its signature. By 
making « = —oo, f = 0 in (15) one thus obtains 


t= I9, S(p). (17) 
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Because s is analytic for p — 0, the upper limit 0 of (17) can be changed 
into +œ. Finally, since 


z—1 (18) 
S = 
z+1 
and since only the sign variations of (18) near its poles (where z = —1) are 
relevant, one may as well replace z by —1 in the numerator, to obtain 
I9. : (19) 
t= Io —-. 
X z+1 


If z is finite at infinity, the known relation ++) between the indices of two 
inverse rational functions gives 


t = IZo (Z + 1) = ISo Z (20) 


which is Forster’s result 5). 

The result (17) is slightly more general than (20). Moreover, the intermediate 
result (15) yields additional information on the minimum number ọ of resist- 
ances. Since the signature of a matrix of order n is comprised between —n and n 
the maximum variation of (15), when « = —oo and f varies from —oo to 0, 
is at most n and one has 

nzyu QD 
where 
w= max I^,s— min Ios (22) 


—-o<f<0 -o«fixO0 


is the maximum variation of Zf œ s in the interval —oo < f < 0. This maxi- 
mum variation (22) cannot be smaller than the modulus of the variation (16) 
on the whole negative real axis, so that one has 


pz. (23) 
Finally, (9), (21) and (23) yield 
2|d|-1. (24) 


In particular, for RC or RL one-ports of degree m one has |z| = m, and 
(24) gives the known result o > m — 1. At the other extreme, for a lossless 
one-port (p = 0), (24) gives |z| < 1 so that A and y differ at most by 1. Al- 
though (24) sets a lower bound on o, this bound is not strict, in the sense that 
a realization satisfying the equality in (24) does not necessarily exist. As a 
counterexample, (24) is satisfied by o = 0 for |-| = ], but it is not true that 
every one-port with ly— 4| = ] is lossless. A strict lower bound on o, more 
severe than (24), will be derived in sec. 4. 
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3. The OY synthesis 


In the OY synthesis, one starts from the prescribed reflectance (8) and solves 
the equation 


ggs — hh. = ff. (25) 


equivalent to (4), to determine the polynomial f with positive leading coefficient 
containing the zeros in Re p < 0, and one-half of the zeros on the j-axis, of 


(25). The matrix 
ids b Fg | Q6) 
Jde —fgf 


is symmetric paraunitary and regular in Re p > 0 because the zeros of f on 
the j-axis cancel in the factor f/f of its 22-entry. The lossless reciprocal 2-port 
of matrix (26) closed on a unit resistance at its output is therefore a realization 
(the so-called regular realization) of the prescribed reflectance (8). This realiza- 
tion is, however, generally not minimal because 


det S, = — gf g (27) 


is not (11) unless f is even or odd. 

. Even or odd factors of f cancel in (27). On the other hand, if f contains a 
square factor a, the factor a.?/a? of (27) can be deleted by multiplying by aja. 
the last row and column of (26), which corresponds to the extraction of a 
physical all-pass from the output. When all excess factors in (27) can thus be 
cancelled, the result coincides with Darlington's synthesis, which is then mini- 
mal. Since transmission zeros on the j-axis can be extracted by a Brune preamble 
and since double zeros can be extracted by a Darlington preamble without 
introducing excess degree, the true OY process only sets in when all transmis- 
sion zeros are simple and outside the j-axis (including infinity). The polynomial f 
Is then Hurwitz, square-free and of degree m, and this will always be assumed 
in the following. 

The first step of the OY process consists in replacing (26) by 


S, = LE. 28 
zh xl (28) 


where S, is a bounded real symmetric paraunitary matrix of arbitrary dimension 
such that 

det S, = + f/f (29) 
so that - 
det S, = + (A/f)? gg (30) 


contains the square of the excess factor of (26) so as to allow an all-pass ex- 
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traction. The all-pass is characterized by a bounded paraunitary (but not sym- 
metric) matrix U such that 
det U = f/f. (31) 
The result after extraction 
S = U"! S, UT! = U, S, Us’ (32) 


is then a minimal realization if the extraction leaves S,, invariant and makes 
(32) regular in Re p — 0. 

It is known in the OY process that the regularity condition for S sets certain 
restrictions, in addition to (29), on the submatrices S, acceptable in (28). A new 
proof of these restrictions will now be described. 

. Firstly, if f and hh. have a common factor, this factor divides gg. by (25); 
since f is Hurwitz, this factor divides g; since (8) is irreducible, this factor does 
not divide /i in hh, and thus divides A+. Consequently f and ^ are coprime, and 
any common factor of f and hh, is common to f, g and h. and simplifies in the 
entry 22 of (26). Since f is square-free, every zero of f thus always corresponds 
to a pole of degree one of (26). By (29), every zero of fis also a pole of degree 
one of S,. Finally a zero of f is a simple pole of degree two of (28) and the 
corresponding residue matrix, say H, has rank 2. Owing to (31), a zero of fis 
a pole of degree one of U, so that U^! = U; has nullity one at such a zero. 
Since the pole of S, must disappear in (32), one must have U. H U,' = 0, 
with all real matrices at a real pole. If H were positive or negative definite, 
a decrease from rank 2 to rank 0 would be impossible by a congruence trans- 
formation of nullity 1; hence H is indefinite of rank 2 and thus has zero signa- 
ture. In conclusion, at a real zero of f, the residue matrix of S, must have 
rank 1 and a signature opposite to the sign of the residue of the entry 22 of 
(26). Since (25) reduces to 


EEs — hh. (33) 


at a zero of f, the ratios //g, g./h and h/g. have identical signs and the condi- 
tion reduces to 
h f. 
sign res matr S, — sgn res — (34) 
gf 
at all real zeros of f. 
Condition (34) has a simple physical interpretation. The realization (26) is 
not minimal because the zeros of fin (27) introduce uncontrollable inobservable 
states. Now, a reciprocal all-pass has a scattering matrix of the form 


MM 


of square determinant and zero trace, so that all its transient modes are double 
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and all its real modes are pairs with opposite reactance signatures. Conditions 
(29) and (34) precisely generate in (28) additional inobservable states of the 
appropriate type. 

Since the only poles of h f./g. f£ on Re p « 0 (including 0 and oo) are the 
zeros of f, (34) is equivalent to 


hf. 


3 [sign S,(8) — sign S,(a)] = 72 
gf 


(35) 


on any interval « <p < f of the negative real axis. Finally, since neither f. 
nor g, have zeros on p < 0, these polynomials can be replaced by the signs 
of their leading coefficients which are identical because f and g have identical 
degrees and positive leading coefficients. Consequently the right-hand side of 
(35) simplifies to 
h 

I. (36) 
J 
4. The number of resistances 


If d is the dimension of the submatrix S, of (28), the dimension of (28) is 
n — d + 2 and the number of resistances is 


o=d+1 (37) 


by (9). In this section we investigate the range of possible values of d for a 
paraunitary matrix S, satisfying the residue condition (34). 

From the Kalman representation of a matrix S, of dimension d > m, it 
results that the normal rank of S.(p) — S,(co) is at most m. This means that 
there exist d — m linearly independent real constant vectors u such that 


S, u = S,(00) u. (38) 


Premultiplying (38) by its transpose and using S,?(co) = 1, one obtains 
u’ (14 — S?) u = 0 for all p, which proves that S, has d — m constant eigen- 
values equal to + 1, so that the d-port of matrix S, can be realized with only 
m resistances. Consequently only solutions of dimension d < m need be con- 
sidered, thus confirming (3). 

For d — m, a solution for S, is the direct sum of diagonal entries 


+ (p — eo) + a) (39) 


corresponding to real zeros —a, of f, and of diagonal blocks 


1 ?.Lo?-—a? 2a,0 
j i i 101; ] (40) 


(p + oj + w? 


2 i @; —p? — oj + aj? 
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of determinant 


(p? + c? — 0,7)? +4 au? ej? (a, — py? + w? (41) 
F+ (+p? coe) 


corresponding to pairs of complex conjugate zeros —a, + jw, of f. The resi- 
due conditions (34) can always be satisfied since each entry (39) contains a 
separately arbitrary + sign. 

Solutions with d « m are obtained by replacing some or all blocks (40) by 
scalars of the form (41) and then contracting the resulting diagonal matrix by 
choosing for some entries arbitrary products of factors (39) and (41). Since the 
contraction of real factors (39) reduces the number of arbitrary signs, a con- 
flict may arise in the residue conditions, and this imposes a lower bound on d. 
If one denotes by v the maximum variation of the Cauchy index (36) when 
æ is —co and when f varies from —oo to 0, one must have 


dev (42) 


by analogy with (21). In contrast with (24), the inequality o > v + 1 resulting 
from (42) and (37) is strict for v # 0: it is easy to construct solutions of all 
intermediate dimensions m zd zv by contracting the solution of dimen- 
sion m or by expanding a solution of dimension v, because the residue con- 
ditions do not interfere with the process as long as the upper and lower bounds 
on d are satisfied. The case v = 0 is special and occurs when f has only complex 
zeros; S, can then be taken as the scalar + f/f, which corresponds to d = 1, 
Q2. 

All the above results only hold when f is Hurwitz, square-free and of degree 
m — deg g, owing to the Brune and Darlington preambles discussed after (27). 
It is easily verified that the OY process works as well without the Darlington 
preamble, so that f need not be square-free. By contrast, transmission zeros on 
the j-axis cancel in the entry 22 of (26) and in (29), so that the realization with 
the largest number of resistances no longer attains the upper bound ọ = m + 1 
of (3) but only o = m' + 1, where m’ is the degree of the simplified rational 
function f/f. Similarly, v of (42) must be replaced by v', the maximum varia- 
tion of the Cauchy index (35) after simplification of common factors in f/f, 
so that the resulting rational function generally differs from (36). 


5. 'The general bordering 


In the preceding section, a number of particular solutions (diagonal or almost 
diagonal) have been obtained for S,. For the complete solution of the equiv- 
alence problem one must, however, find all matrices S; of prescribed deter- 
minant (29) and satisfying the residue conditions. Physically, the problem con- 
sists in finding all lossless d-ports, with d < m, having prescribed transient 
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modes and prescribed reactivity indices for the real modes, when closed on 
d unit resistances. In the state-space description, the resulting closed network 
of degree m is characterized by a 0-symmetric hybrid matrix of dimension m 
closed on m unit reactances, and the class of equivalent networks is obtained 
by the similarity transformation !?) 


F=T-!HT (43) 
where T is 0-orthogonal, i.e. satisfies 


T'OT=9 (44) 


and preserves passivity, i.e. leaves F + F’ non-negative definite with H + H’. 
Moreover the number d of resistances contained in the closed network is the 
rank of F + F'. : 

At this stage the following problem arises: given a closed network defined by 
a constant hybrid matrix H closed on m reactances, find the scattering matrix S 
of the same network considered as a lossless d-port, with d = rank (H + H’), 
closed on d resistances. Since the d unit resistances remain invariant by an 
orthogonal transformation, S is only defined within such an arbitrary trans- 
formation. Moreover, since H + H' is the direct sum of a resistance and a 
conductance matrix, it can be diagonalized by an orthogonal transformation 
(itself a direct sum) leaving invariant the unit reactances. If this orthogonal 
transformation is applied to H, producing 


HH, = 2 HQ, (45) 


the resulting closed network takes the form of fig. 1. This network can also be 
considered as the m-port of fig. 2 whose series (shunt) ports are shorted (open) 
as indicated. Now the scattering matrix of a lossless d-port of some hybrid 
matrix H, closed on unit resistances is 


S = 0 — 20 (H, + 1,)7'. (46) 
In (46), H, + 1, is the hybrid matrix of the augmented d-port of fig. 3, and 


Fig. 2. 
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the closed network is obtained by shorting (opening) the series (shunt) ports 
as indicated. It is now clear that the d-port of fig. 3 results from the m-port 
of fig. 2 by inserting ideal transformers so as to normalize to unity the non- 
zero resistances r (conductances g) internal in H, -- p 1m (as obvious from 
fig. 1) and so as to short (open) the m — d ports corresponding to zero resist- 
ances (conductances). If 4? denotes the diagonal form H, + A,’ of H+ H', 
it then results that the inverse hybrid matrix (H, + 1,)^* of the d-port of 
fig. 3 is simply the inverse hybrid matrix (H, + p 1,,)~+ of the m-port of fig. 2 
transformed by 4... ^, with m—d rows and columns of zeros deleted in 
the result. By (45) and (46) one thus has 


S—[0—0 A(Q HO + pl)! A] (47) 


where the subscript indicates that the rows and columns corresponding to the 
zero entries of A are deleted, and where 


Q (H+ H) Q =A? (48) 


with A diagonal and 2 orthogonal. 
If one takes for H a direct sum of diagonal entries «, for the real zeros 
—a, of f and of diagonal blocks 


a; Qi 
| ] (49) 
—0, CA 


for the pairs of conjugate zeros —&; + jo, of f, by applying (47) with d = m 
and 2 = lm, one obtains for S the direct sum of entries (31) and blocks (40) 
discussed in sec. 4. The general solution of the bordering problem is then 
deduced by replacing that matrix H by (43) and by applying (47) and (48) 
with H replaced by F. The solution is, however, only implicit, because the 
necessary and sufficient conditions for passivity preservation in (43) are not 
known. 
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6. Synthesis with two resistances 


In accordance with the discussion at the end of sec. 4, a synthesis with two 
resistances is possible in two cases: 

(a) v — 0: f has only complex zeros. 

(b) v = 1: the signs of the residues of A f./g. f in (34) alternate at consecu- 
tive real zeros of f; since f. and g, have a constant sign on p < 0, this is equiv- 
alent to the condition that consecutive (simple) real zeros of f must be separated 
by an even number (or zero) of zeros of h. 

When @ = 2, the submatrix S, of (28) reduces to the scalar + fi/f by (29) 
and the ambiguous sign is arbitrary in case (a) but determined by the residue 
condition (34) in case (b); this condition requires + f/f and h f.[g. f to have 
identical signs, thus forcing 


+ = sgn Bd (50) 
ge. 

at all real zeros of f, which is clearly compatible with the condition stated in (b). 

To complete the synthesis, it remains to determine the transformation matrix 

U making (32) regular in Re p — 0. Since U must leave port 1 invariant, it 

is the direct sum of unity with a bounded paraunitary matrix of dimension 2 

of determinant (31). Such a matrix is of the form (within irrelevant polarity 
reversals) 


1 0 0 
U=|0 zif olf (51) 
0 Mf = xd. 
with 


tte + dé. = ff. (52) 
The resulting matrix (52) is . 


hig = xdg dle 
S — | xg S22 S23 (53) 
$/g S23 S33 
with 
S22 = (+ fe? g — he ye") fg, (54) 
S23 = HE GLE + $ hf (55) 
S33 = CE x? 8— P h)/fg. (56) 
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The entries of (53) are analytic in Re p > 0 if the numerator polynomials 
of (54), (55) and (56) are divisible by f, i.e. if one has 


bs = & xy» (4 4/g)*, (57) 
put = F x à (e/g), | (58) 
X = M $ GE hjg) (59) 


at the zeros of fa, where the symbols e; = + 1,4, = + 1 have been introduced 
to emphasize the fact that the determinations of the square roots in (57) and 
(59) are separately arbitrary at every zero of f.. Note that, at all these zeros, 
the rational function + /./g occurring in all expressions is finite because g is 
Hurwitz and non-zero because f and h have been proved coprime in sec. 3. 

The product of (57) by (59) yields (58) iff e, 3, = —1, so that one must 
replace (59) by 


X = —& $ (+ hg)* (60) 


at the zeros of f., and (58) may then be dropped. The remaining conditions 
are thus (57) and (60) with identical determinations for the square roots. 

Conditions (57) and (60) define a linear interpolation problem +°) for the 
rational function y/$. Since there are 2m homogeneous relations for 2m -+ 2 
undetermined coefficients, there remain two degrees of freedom. In any case, 
there is at least one non-zero solution (y zÆ 0, $ Æ 0). Now, the product of 
(57), by (60) with both sides permuted, yields, after cancellation of a common 
factor, 


—$$. = xx« (61) 


at the zeros of f}, so that (52) is automatically satisfied for all non-zero solu- 
tions y,¢ if one of the available degrees of freedom (a common factor in x 
and $) is consumed to match the leading coefficients in (52). Note that the 
resulting rational function y/ó may be reducible, but its degree (unsimplified) 
is necessarily m. The remaining degree of freedom corresponds to the fact that 
the submatrix of (51) corresponding to ports 2 and 3 can be multiplied by an 
arbitrary constant orthogonal matrix without affecting the synthesis. This free- 
dom is suppressed by imposing some arbitrary value to y/¢, for instance at 
infinity. With any such convention, the solution y,9 is unique for a given set 
of signs for the e, in (57) and (60) and for a given sign of (50) when the latter 
is free. The only restriction on the signs of e, is that complex conjugacy must 
be respected at conjugate zeros of f+, so as to yield real polynomials y and 4. 
Consequently, a discrete set of solutions is obtained. 

Owing to (33), one may replace h/g by g./h in (57). Taking the paraconjugate 
of the result and solving for y yg one obtains 


X Vs =e,¢ V+ he (62) 
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at the zeros of f. Note that, in contrast with (57), division by ys is not allowed 
in (62) if g, h. and f have a common factor. In any case, (62) is sufficient to 
prove that the numerator of (56) is divisible by f. Similar manipulations on 
(58) and (59) show that not only f, but also f, cancels up and down in all 
expressions (54)-(56), so that g is the common denominator of all entries of 
(53), as expected for a minimal realization. 

The above synthesis, when it works, is much simpler than the Brune syn- 
thesis, since the only algebraic equation to be solved is (25). Although the 
remaining interpolation problem is linear, closed-form solutions cannot gener- 
ally be obtained because x and ¢ involve the irrational values of (+ /,/g)*, so that 
the cancellation of ff. in (54)-(56) must be achieved numerically. For an imped- 
ance of degree one, which can be handled in closed form, the result coincides 
with Brune's synthesis; the latter then only involves linear equations, whereas 
the OY synthesis is exceptionally heavier because (25) is a second-degree equa- 
tion. 


7. 'The rational case 


In this section we obtain closed-form solutions for the case where + /./g 
is the square of a rational function. Since 4. and g may have common factors 
we set 

h, = +u? q; g-—wq, (63) 
hence 
h= +? qe. (64) 


Condition (50) is then automatically satisfied. Moreover, (25) becomes 


qq«[(vv)? — (uu) ] = fa. (65) 
Conditions (57) and (60) require 
gs = E; X» UV; X = —E $ usv (66) 


at the zeros of f.. One of these relations is satisfied if either y/ġ or $.[xy. is 
proportional to w/v. We take the first choice, for the other one merely inter- 
changes 7,¢ with $, y. in all results and simply permutes ports 2 and 3 in 
(53). We thus set 


x80u; ó—60v (67) 

and (66) becomes 
0 (1 + &) — 0, (68) 
8. (vv. — £; uu) = 0 (69) 


at the zeros of f.. At the zeros contained in the factor vv. -- uu. of (65) it is 
advantageous to adopt e, — —1, for both (68) and (69) are satisfied without 


^5 
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extra cost in @ or 0.. At the zeros of f. contained in the factor vv. — uu. of 
(65), (68) and (69) are satisfied either by 


€, = —1; 0, = 0 (70) 
or by no 
820; £j — 1, (71) 


so that the corresponding factors can be arbitrarily distributed between @ and 6., 
which means that Ó must be divisible by f, an arbitrary solution (not necessarily 
Hurwitz) of 

vU. — Uus = fi fis (72) 
Finally, f. contains one half of the zeros of the factor qq. of (65). Since f is 
Hurwitz, and since q is Hurwitz as a factor of g in (63), these zeros are the 
ones of q». At such zeros the factor vv. — e, uu. of (60) does not vanish spon- 
taneously for e, = + 1, so that one must adopt 0, = 0 in (70). Consequently 
0. must be divisible by qs. Finally, the solution is 


0 — 4f. (73) 


Using (63), (65), (66), (72) and (73) one brings the matrix (53) into the 
simple form 


wq u fis qs f 

v?q vg v 
UJis Ws K + Us 

E fisa ^ q Edd (14) 

vq v?q v 

fi UA 

2 pL 0 

v v 


The particular case of (74) corresponding to g = 1 and the upper signs has 
been obtained by a different method in a previous paper !^). From that ref- 
erence it results that the lossless 3-port of matrix (74) can be realized as the 
all-pass 4-port of scattering matrix 


IL 75 
E d ya 


1 u fi 
So = =| | (76) 
v cfe F Us . 


whose port 3 is closed on a reactance of reflectance + q./q. 
Because of the arbitrariness of the factorization (72), (74) defines a discrete 
set of solutions. Because the degree of f, is at most one half of the degree 


where 
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of f, not all solutions are thus obtained: additional solutions with less system- 
atic choices for the signs of e, are not expressible in closed form. . 


8. State-space structures 


For a one-port containing o resistances, a realization is in state-space form 
and canonic if the corresponding lossless (o + 1)-port is an ideal-transformer 
network with inductances (capacitances) connected at the series (shunt) ports 
and such that, after termination of some additional (series or shunt) ports on 
@ resistances, the total number of components becomes (5) for o = 2, (6) for 
o = 1 and m for ọ = 0. Solutions for o = 1 and o = 2 will be successively 
obtained as generalizations of the known solutions for o = 0, discussed first. 

For a lossless one-port characterized by its impedance, any state-space 
realization is a transformer network of some ratio matrix N whose first series 
port (for instance) is the external port, the remaining series ports being closed 
on unit inductances whereas all shunt ports are closed on unit capacitances. 
Separate orthogonal transformations on the inductances and on the capaci- 
tances leaving the port invariant permit to reduce to zeros many entries of N.- 
The two extreme patterns (where crosses denote non-zero entries) are 


P L L L 
C X X 
C X x (77) 
C Xo X 
and 
P L L 
C| X X X X 
C x (78) 
C x 
C x 


Either pattern is reached by applying successive orthogonal transformations of 
order 2 as in the Givens process 15) so as to reduce systematically to zero the 
lower entries of the first column, then of the first (second) row, then of the 
second column, then of the second (third) row, ... in such a way that the 
zeros already obtained are unchanged by subsequent transformations. The lad- 
der structure resulting from (77) coincides with Cauer’s synthesis (continued- 
fraction expansion) whereas (78) yields the Foster synthesis (rational-fraction 
expansion), but a large variety of mixed expansions (all having a total of m 
non-zero entries in N) can be generated by mixed strategies. 

In contrast with all other reactance synthesis procedures, the Cauer synthesis 
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of (77) only involves linear equations. Our extensions to o = 1 and 9 = 2 
will therefore be generalizations of (77). On the other hand, a state-space de- 
scription of an n-port only exists if the specified hybrid matrix (including imped- 
ance or admittance matrices as particular cases) is finite at infinity, but at 
least one such matrix exists by Zuidweg's theorem !5). For o = 0, the alter- 
native is only between impedance and admittance, but the variety of hybrid 
matrices is larger for ọ = 1 and o = 2. 

For o — 1, if the lossless 2-port is defined by its impedance matrix, both 
ports are series ports and the application of the Givens process yields the 
pattern 


(79) 


which is tridiagonal, whereas (77) was bidiagonal, because one more port 
remains invariant. If the 2-port is characterized by its hybrid matrix, the 
resulting pattern is 


` 


P, x x 

C; X X X : (80) 
C; X X X 

C3 X X 


Finally, the admittance description gives the dual of (79). 

The number of non-zero entries in (71) and (80) depends on the reactance 
signature T, restricted by [e| < 2 as a result of (24) with ọ = 1. The resulting 
number is given in table I, where N’ is (6) and where I denotes an impossibility 


TABLE I 
T —2 —i 0 1 2 
m even odd even odd even 
(79) I I N'—1 N' N'—1 
(80) I N' N’ N’ I 


arising because some inductances or capacitances are no longer independent. 
For the pattern (80), the 2-port reduces to an ideal transformer at infinity and 
this forces cœ = 0, thus making t = + 2 impossible by (26) since og is 0 
or + 2. For the pattern (79), there is at least one transmission zero at infinity 
(C, shorts P,) and this makes v = 2 possible (the dual pattern permits t = —2) 
but the number of parameters then reduces by 1 (except for odd m where this 
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transmission zero is not a restriction). In any case the whole class of networks 
depending on N’ parameters is realizable by (80). By terminating P; on a unit 
resistance and denormalizing all components, one reduces the ratio matrix to 


R L L, Ls 


(81) 


and the realization is fig. 4, which is thus a canonic state-space structure equiv- 
alent to Darlington's realization. 
For p = 2, we omit the analogue discussion and simply justify the form 


R, R L L L 


(82) 


. generalizing (81) and having the canonic number of parameters ọ = 2, 
v = m — 1l. At infinity, a general lossless 3-port reduces to a 3-winding trans- 
former with all finite non-zero numbers of turns, or to its dual. Adopting the 
first alternative, one characterizes the 3-port by a hybrid matrix of the form 


li vy 
v| =H) i, (83) 
v3 i3 


where the external port 1 is shunt, whereas the series ports 2 and 3 are closed 
on resistances. This accounts for the row and column markings in (82). The 
entry 11 of (82) has been reduced to zero by an orthogonal transformation 


Fig. 4. 
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on the resistances whereas tbe general quadridiogonal pattern results from 
Givens' process. Finally, the unit entries are produced by denormalizing the 
resistances and reactances. 


9. Hybrid matrices of lossless 2- and 3-ports 


In order to deduce a state-space realization from the OY synthesis (which 
only yields the bordering matrix S), it remains to compute the required hybrid 
matrix H of the lossless n-port and to generalize Cauer's continued-fraction 
expansion to n-port hybrid matrices. 

The impedance matrix Z is related to S by 


gi eos) ue. (84) 


and any other hybrid matrix H (including admittance) is given by (84) with S 
replaced by 0 S, where 0 is diagonal of entries + 1, the negative entries 
affecting those port variables whose nature is modified in passing from the Z 
to the H description. In the computation of the determinants appearing in (84) 
many simplifications occur when S is para-unitary, owing to Jacobi's theorem 
on adjoints, and this allows one to obtain simple expressions for all entries of 
Z for dimensions 2 and 3. In order to deduce similar expressions for H one 
must, however, write the relations between Z and S without making use of 
the symmetry of S (i.e. keeping distinct symbols for S,,; and S;;) because the 
relations must be usable for the non-symmetric matrix 0 S. 

In the following formulas we denote by S,,; = s,,/s the entries of S, s being 
the monic polynomial defined by 


det S = F ss. (85) 


The ambiguous sign in (85) has been noted so as to agree with the usual con- 
vention for reciprocal 2-ports where 


1 rA 
S = =| f | (86) 
glf Fh 
hh. + f? = gg. (87) 
yields det S = F g./g, the upper (lower). sign holding everywhere for f even 
(odd). 
For a 2-port, the only simplification occurring in (84) results from using 
(85) in 


with 


det (1; — S) = 1 — S11 — S22 — det S 


and the result is 
1 $-E5$.4d-5,,—5 2s 
z2 | F 11 22 512 ] . (88) 


S E Se — 541—522 2521 S F Se — S11 + 522 
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The formula for the hybrid matrix expressing i,,v in terms of v,,i, results 
from (88) by using 0 = diag (—1, 1}, i.e. by first changing the sign of s,, and 
5,2 everywhere, and then replacing all polynomials by their special values of 
(87). This gives 
1 *— h + h. —2 
-— — D Fg f | l (89) 
gdg--h-h. 2f gTg2 T hcEh. í 
For a 3-port, the Jacobi relation for minors of order 2 gives simplifications 
such as 


(also for non-principal minors), so that one has 
Siis + S22 + S33 
det (13 — S) = 1 — S1; — S22 — Saa + AR das. 
det S, 


The final expressions for the entries of (89) are 

S + S11 — 822 — $33 F (S + S11 — S22 — S33) 

$— S11 — 523 — S33 + (S — $11 — $22 — S33) 
2(512 F Sais) 


$— S11 — 822 — $33 + (S — S11 — $22 — $33) 


Zu = 
(90) 
Zi2 = 


and the remaining expressions are deduced by circular permutations. 


10. Continued-fraction expansions of hybrid matrices 


Since the hybrid matrix of a lossless n-port is para-odd (H + H = 0) but 
not symmetric, it has all the analytical properties of the impedance matrix of 
a lossless non-reciprocal n-port, and Newcomb’s extension +7) of Cauer’s syn- 
thesis applies, but additional consequences result from the 0 symmetry of H. 

If H is identically singular it is (within a symmetric permutation) of the 


form 19) 
H, Ho C 
| 0 o | (91) 
C' Ho C' Ho, C 
with Ho a non-singular submatrix of H and C a constant matrix. By writing 


that (91) is 0-symmetric with 0 partitioned into 0, and 0; conformally with 
(91), one obtains, among others, the relations 


Ho 0, = 0, Ay; Ho C0,— 0, Hy C 
which yield 
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by elimination of 0, Hy’ and division by Ho. With the partitions 


i> 0 1, 0 C, Cs 
0, — ; b= ; c-| 
0 —l; 97 15 Cp, Cas 


(99) forces Ces = Cg, = 0 so that C reduces to a direct sum of C,, and Cg;. 
Consequently H is realized as Hp whose series ports are seen through a trans- 
former network of ratio matrix C,, and whose shunt ports are seen through a 
separate transformer network Cj. 

Consider now a lossless hybrid matrix with some partition into shunt and 
series ports; it is of the form 


H(p) -| 


Y(p) 24] (93) 


—N'(p  Z(p) 


where Y and Z are symmetric reactance matrices (hence odd) whereas N is 
even. If H(oo) is finite, (93) is of the form (up to terms in 1/p) 


Alp B 
dd el x] 


with 4 and C symmetric. The operation 


0 B 
mo-i- y 4 (94) 


represents the series-parallel extraction of an ideal-transformer network of 
ratio matrix B. All entries of (94) vanish at infinity. If (94) is not identically 
singular, its inverse must have a simple pole at infinity, which is only possible 
if A and C are non-singular. The principal part at infinity of the inverse is then 


TA y=? a | (95) 
WP. 0 pc 
and can be subtracted, corresponding to series-inductance (shunt-capacitance) 
extractions at the series (shunt) ports, yielding a hybrid matrix which is finite at 
infinity, on which the process (94) can be iterated. If (94) is identically singular, 
a preliminary transformer extraction is done separately at the series and shunt 
ports, and the extraction of (95) is performed on the inverse of the resulting 
matrix. 

In the case of a 2-port the basic step in the above iterative process is repre- 
sented in fig. 5 for the non-degenerate case, the permutation of ports I-II with 
respect to 1—2 resulting from the inversion of the hybrid matrix in (95). When 
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the process of fig. 5 is iterated on the successive internal 2-ports, one obtains 
a state-space structure of ratio matrix 


P, Lı Ll, Ls 


(96) 


where the part below the dotted line is the internal 2-port of fig. 5. The pattern 
of (96) is identical to (81) except for normalization and contains one redundant 
ratio because the terminating resistance is still 1 in fig. 5. 

Finally we show that ail component values in (96) can be obtained by iterative 
linear computations on the polynomials f, g, h of the scattering matrix with- 
out explicitly inverting any hybrid matrix. In fact, the extraction of a series 
inductance from the input of the 2-port (f, g, h) having at least one trans- 
mission zero at infinity (deg f< deg g) is the most elementary case of cas- 
cade decomposition 1°) and yields a residual 2-port of polynomials 


J’ =f, 
g —g-dciLp(h—g) (97) 
h' =h+4tLp(h—g), 
where 
+h 
Se" i (98) 
p(g—h)-o 


Similar formulas hold for the dual extraction of a shunt capacitance at port 2. 
It thus only remains to find analogous formulas for the hybrid extraction of 
an ideal transformer. 

If (89) is finite at infinity, its value is 


0 —n 
H(oo) = k | (99) 
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2f 


mctu (100) 
g-ge«-- h hie 


Identifying the difference between (89) and (94) to a hybrid matrix of the form 
(89) in terms of f’, g', ^', one obtains 


f -f-. X don (101) 


and three other relations which immediately give by sums and differences 


E Te HEF se, (102) 
h F hi =h F h, (103) 
gth =gth. (104) 
From (87) and the similar dashed formula one deduces 
Why! — hh. £ Cf! —f) C +f) = 8'8 — gg. (105) 


Replacing f' by (101) and using the identity 
X489 = (g + e? — (g F &)? 


and similar identities for hh, and the dashed products, one transforms (105), 
after cancellation of terms produced by (102) and (103) into 


(g +g) — (h + h) — 2n (g + g + h +h) [f —5(g 8 +h h)] 
= (g' +g yY —( hY. (106) 
The left-hand side is divisible by g + g« + h + h. and the right-hand side by 


g'g +h’ + h', equal to the first expression by (104) + (104)'. One thus 
obtains 


g +g hk Fh =gtg—hF ha —n [4f — n (8 + g& + h + h)]; 


hence, by linear combinations with (102)-(104), 


g =g- ne Egt hho), 
(107) 
n 
Eh Wong Egt kein] 


The final expressions are thus (101) and (107). 
For an antimetric 2-port (h. = + A), (107) yields A.’ = + k’, so that the 
residual 2-port remains antimetric. Since antimetry also guarantees dual ex- 
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tractions of a series inductance and shunt capacitance from opposite ports, the 
iterative synthesis based on the hybrid matrix yields an antimetric structure. 


MBLE Research Laboratory Brussels, September 1975 
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ON THE THEORY AND THE CALCULATION OF 
WORST-CASE EYE OPENINGS IN 
DATA-TRANSMISSION SYSTEMS 


by H. C. van den ELZEN 


Abstract 


For linear data-transmission systems, such as base band, amplitude 
modulation with suppressed carrier (AMSO), vestigial sideband (VSB), 
single sideband (SSB) and quadrature-amplitude modulation (QAM), 
it is shown that the impulse response is constituted of two basic func- 
tions, the p(t) and g(t) functions. Except for base-band systems, the phase 
angle between the carrier wave at the transmitter and the carrier wave 
at the receiver determines how the p(t) and q(r) functions should be com- 
bined to obtain the impulse response. The so-called signal-space diagram 
can also be constructed with the p(t) and g(r) functions. With the impulse 
response it becomes possible to construct the eye pattern of the system 
and consequently the worst-case eye opening, which are criteria by which 
data-transmission systems can be judged. The definition of the worst- 
case eye opening is given for two-level systems and extended to multi- 
level systems, coded systems and orthogonal (QAM) systems with inde- 
pendent or dependent channels. Due to the infinite summation in the 
definition of the worst-case eye opening, its actual calculation can only 
be an approximation. The approximation error is discussed at some 
length with the aid of some examples. The instant of time at which the 
worst-case eye opening is a maximum is considered as well as the nor- 
malisation constant. Also the probability of error for data-transmission 
systems is briefly discussed. 


1. Introduction 


A frequently used method to judge data-transmission systems is the study of 
eye patterns and worst-case eye openings. The eye pattern can easily be dis- 
played on an oscilloscope by using a horizontal sweep, the repetition frequency 
of which is the inverse of the bit length, and applying the data signal to be 
considered to the vertical amplifier. The worst-case eye opening, which is de- 
fined as the worst-case inner envelope of the eye pattern, cannot be displayed 
so easily. For theoretical investigations of data-transmission systems, however, 
the calculation of worst-case eye openings has proved to be very useful. In this 
paper the theoretical fundamentals of these calculations will be treated for 
linear data-transmission systems. By linear we mean that the superposition 
tules apply, so that the complete system can be characterized by its impulse 
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response (or two impulse responses in orthogonal systems). For baseband data 
transmission the impulse response is simply the inverse Fourier transform of 
the product of the transfer functions of the transmitting filter, the channel and 
the receiving filter. For linear modulation-demodulation systems the method 
of finding the impulse response becomes more complicated, due to the in-phase 
component and the quadrature component. Moreover, the impulse response 
depends on the phase difference between the carrier in the transmitter and in 
the receiver. In sec. 2 we therefore give an analysis of these systems. It turns 
out that two basic functions can be defined, the p(t) and q(¢) functions, valid 
for any type of modulation to be considered, such as AMSC, VSB, SSB or 
QAM. The functions p(t) and q(t) depend only on the various transfer func- 
tions. After specification of the type of modulation-demodulation, the func- 
tions p(t) and q(t) have to be combined in some particular way to obtain the 
impulse response. The p(t) and q(t) functions can also be used to construct the 
so-called signal-space diagram of a modulated system. Because this signal-space 
diagram plays a role in the determination of worst-case eye openings for orthog- 
onal (QAM) systems, we treat them also in sec. 2. For the analysis of linear 
systems it is sometimes easier to look at the time-domain representation and 
sometimes easier to look at the frequency-domain representation. For clarity 
and for completeness, both representations are treated in sec. 2. 

Section 3 starts with a mathematical description of the eye pattern and of 
the worst-case eye opening for two-level systems. In the subsequent sub-sections 
the definition of worst-case eye openings is extended to multilevel systems, for 
coded systems and for orthogonal (QAM) systems with dependent or independ- 
ent channels. After having defined the worst-case eye openings for various 
circumstances, sec. 4 describes methods of calculating eye openings. It is shown 
that the discrete Fourier transform (DFT) is a convenient tool to approximate 
the p(t) and q(t) functions from which impulse responses are obtained. The 
relation between the Nyquist frequency, the bit length and the period of the 
DFT is considered. Examples are given showing how well worst-case eye open- 
ings are approximated. The instant of time at which the worst-case eye opening 
becomes a maximum is discussed. The normalization constant is also discussed. 
Finally in sec. 5 the probability of error of data-transmission systems is briefly 
discussed for two-level systems only. 


2. Mathematical description of linear data-transmission systems 


2.1. Representation in the time domain 


2.1.1. Systems with one single carrier 


Figure 1 shows a general block diagram of a linear modulation-demodulation 
system. The input signal is denoted s,(¢). The impulse response of the low-pass 
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noise 
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cos cot hlt) cos fost- pep c 


Fig. 1. Block diagram of a linearly modulated system. 


filter LP, is ^, (f). Then the output signal of the low-pass filter becomes the time 
convolution of the input signal and the impulse response. Thus we have 


So(t) = si(t) * hQ). (1) 


A signal s,(¢) now produces a linearly modulated wave when multiplied by - 
a sinusoidal carrier 


53(t) = s,(t) cos wet. (2) 


It is assumed that the carrier frequency f, is larger than the bandwidth B of 
the low-pass filter LP,. This restriction is introduced in order to prevent the 
occurrence of the so-called fold-over distortion +). (From ref. 1 we know that 
in some cases it is possible to correct for the fold-over distortion in an elegant 
way.) 

The modulated wave of (2) can be transmitted across the transmission chan- 
nel, which in general has a bandpass character. The additive noise on the chan- 
nel will be neglected in the present description. In most cases a transmitting 
and a receiving filter are used with some specified bandpass characteristics. In 
fig. 1 the transmitting and receiving filters are put together with the transmission 
channel to form one bandpass characteristic BP, having an impulse response 
h,(t). The signal at the input of the receiver, which will be called the /ine signal 
then becomes 


Sa(t) = [s2(t) cos wt] * h(t). (3) 


Writing out the convolution as an integral we have 


54(t) = f st — 1) cos cx (t — T) h;() dv. (4) 
This can be re-written: 
S4(t) = P(t) cos ot + Q(t) sin ct (5) 


where 
P(t) = f st — T) cos oz h,(t) dt = s;(t) * [h,(t) cos ct] - (6) 


and 
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oo 


Q(t)= f s;(t— 2) sin ocv h;(z) dv = s,(t) * [h;(£) sin ot]. (7) 


-0 


The functions P(t) and Q(t) in general are called the in-phase and the quadrature 
components of the modulated signal 7-3). The further description of linear 
modulation-demodulation systems will be given in terms of these P(t) and 
Q(t) functions. 

Referring again to fig. "1 we see that demodulation of the line signal zu 
takes place with a sinusoidal carrier f,. In practical systems there might be 
present a frequency off-set and/or a phase off-set (or jitter) for this carrier. 
We therefore introduce the phase function (t), so that after passing through 
the low-pass filter LP, the demodulated signal becomes 


Se(t) = {sa(t) cos [wet — p(t)]} * h(t). (8) 
Substituting (5) into (8) and using some trigonometric identities we obtain 
Selt) = X IPE) cos y(t) + Q(z) sin y(0)] * Ast) + 40; (9) 
where 
A(t) = 4 {P(t) cos [2w.t — v(t)] + Q(t) sin [20,t — p(t)]} * halt). (10) 


As h(t) is the impulse response of a low-pass filter the expression for A(¢) is 
zero. This will become clear from the frequency-domain description. 
2.1.2. Systems with two carriers in quadrature 


For a system with two carriers in quadrature (also called an orthogonal 
system), as depicted in fig. 2, the various signals can be described in the same 
fashion as in the preceding section. The line signal s,,(¢) now becomes 

Sao(t) = P,(t) cos wt + Q(t) sin ct + P(t) sin wet — Q,(t) cos wet, (11) 


where the functions P,(t), P,(t), Q(t) and Q,(t) have been defined in expres- 
sions (6) and (7). The index c stands for “cosine branch” and the index s stands 


sin fcx t- yt) 
s,. (t) s S. (0 


Do A 


Fig. 2. Block diagram of a linearly modulated orthogonal system. 
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for "sine branch" of the system. As can be easily verified, the output signals 
Sec(t) and sg,(£) become 


Sec(t) = 4 {[Pe(t) — Q,(t)] cos p(t) + [Q.(7) + P,(t)] sin v(2)) * halt) + Alt), 
(12) 


Ses(t) = $ (LP. + Q.(¢)] cos p(t) + LOE) — P.()] sin y(t)} * halt) + 4C). 
(13) 


The components 4,(t) and A,(t) are similar to A(t) and can be considered to 
be zero. 

It is clear from (12) that information from the one branch is present in the 
other and vice versa. Hence, in general, both branches are distorted by wrong 
information. This is called cross-talk distortion. It is emphasized, that this cross- 
talk distortion is not only due to the quadrature components Q(t) and Q,(t), 
but also, because of the phase angle y(t), to the in-phase components P,(t) and 
P,(t). Under which circumstances the cross-talk vanishes, will become clear in 
sec. 2.4.1. 


2.2. Representation in the frequency domain 


2.2.1. Systems with one single carrier 


In sec. 2.1 we derived expressions for the so-called in-phase component P(t) 
and the quadrature component Q(t). Making use of the Fourier transform and 
its inverse, we rewrite (6) and (7) as follows: 


P(t) — F-(F(s,(t) * [h,(¢) cos oxt])), (14) 
Q(t) = F-(Físs(t) * [42(t) sin w,t]}). (15) 
Writing out the Fourier transforms, we obtain 
P(t) = F-* (3 S,(@) [H,( + o) + H,(w— @,)]}, (16) 
Olt) =F- {Fj S2(@) [H,(@ + o.) — H,(w— w,)]}, (17) 


where S,(w) is the Fourier transform of s(t) and H,(w) is the Fourier trans- 
form of h,(t). As has been mentioned above, the bandwidth of the low-pass 
filter LP, is equal to B where B < f.. This means that the spectrum S,(w) of 
the signal s,(t) is band-limited to the interval [—B, B]. Figure 3 shows the 
spectrum S (w) schematically. f 

The transfer function H,(w) is also depicted, as well as its shifted versions, 
so that the frequency-domain representation of P(t) and Q(t) is easily obtained. 
The frequency-domain representation of the line signal 54(t) can be obtained 
by the Fourier transform of (5): 
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Fig. 3. Frequency representation of some transfer functions and shape factors, occurring in 
linearly modulated systems. 


1 
S,(@) = = [F{P(t)} * F(cos wt} + F( Q(r)) + F(sin w,t}]. (18) 


Using (16) and (17) in (18) and carrying out the frequency convolutions yields 
S,(w) = 4 [S2(@ — o9) + S2(@ + @,)] Hoo). (19) 


The spectrum S,(w) is depicted in fig. 3. Of course this result is the same as 
that found from a direct frequency-domain inspection of fig. 1. 

For the demodulated output signal s,(t) the frequency-domain representa- 
tion is obtained by direct inspection of fig. 1 or by taking the Fourier trans- 
form of (8): 


So(w) = (Sa(@) * F{cos [wt — v(t)))) Ha). (20) 


Using expression (18) for S4(w) and substituting into (20), with some algebra 
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Fig. 4. Frequency representation of some transfer functions and shape factors, occurring in 
linearly modulated systems. 


one easily arrives at expression (21), which is the frequency-domain equivalent 
of (9) and (10): 


Se(w) = 4 F{P(t) cos y(t) + Q(t) sin y(t) } H3(w) + 

+ 4F(P(0 cos [2w.t — y(t)] + QC) sin [2w.t — p(t)]} Hao). Q1) 
The various terms occurring in (21) are depicted schematically in fig. 4 (for 
p(t) = 0). The low-pass filter LP, is considered to be a filter which suppresses 
the higher components concentrated around 2w.. The term A(t) in (9) and (10) 
has therefore been discarded. Similarly the second term in (21) can be dis- 
carded. Hence, the frequency-domain representation of the output signal be- 
comes 


Selo) = 3 F(P(1) cos p(t) + QC) sin y(t)} Hs(o). Q2) 


2.22. Systems with two carriers in quadrature 


For orthogonal systems similar considerations apply as for systems with one 
single carrier. The line signal s,,(¢) of expression (11) can be represented by 


1 
S4o(@) = oa [F{P.(t)} * F(cos wet} + F{O,(t)} * F(sin et) + 


+ F{P,(t)} * F(sin wt} —F{Q,(t)} * F(cos ot], (23) 


where the Fourier transforms of P.(t), Q,(t), P(t) and Q,(t) can be written 
as (16) and (17), but with S,,(@) or S,,(w) instead of S,(w). Writing out (23), 
we get 
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S4o(@) = i [S; «(o T We) + S2.(@—@¢) Tj Sz% + we) —j So T w.)] Ho). 
(24) 
The result of (24) can also be obtained by direct frequency-domain inspection 


of fig. 2. The final output signals after demodulation and filtering can be derived 
in a similar way to systems with one single carrier. The results are 


See(w) = $F{[P.(t) — Q.)] cos p(t) + [Q-(¢) + Ps(t)] sin y(t)} Ho), — 25) 
Ses(w) = $F{IP(1) + Q-(4)] cos p(t) + IO) —Pe(t)] sin p(t)} Hs(e). (26) 


2.3. Impulse responses 


With linearly modulated systems it suffices to describe them by their impulse 
response. Once this impulse response has been found, the system can be looked 
upon further as a base-band system. From the previous sections it is quite easy 
to find this impulse response by simply taking the Dirac delta function as the 
input signal. 

From (1) and (6) and (7) we then have 


P(t) = hy(t) * [hy(£) cos wet], (27) 
Q'(t) = h(t) * [h;(t) sin ot], (28) 


and from (9) and (22) the output signal sg'(t), which is equal to the impulse 
response A(t), we then have 


56’ (t) = h(t) = 3 [P'(t) cos y(t) + Q'(r) sin p(t)] * As). (29) 


The “base-band equivalent” of the linear modulation system is depicted in 
fig. 5. The “base-band equivalent” for the orthogonal system can be found in 
a similar way. It is depicted in fig. 6. The expressions for the direct and cross- 
talk impulse responses are found to be 


hy s(t) = hz2(t) = $ [P'(t) cos yt) + Q'(r) sin y(r)] *43@), GO) 
hy2(t) = —h21(t) = 4 [Q'(t) cos p(t) — P'(t) sin y(¢)] * hale). (31) 


As mentioned in sec. 2.2.1 the low-pass filter LP, is considered to suppress 
components around 2,. In most cases the filter is constructed such that it 
does not affect the relevant low-frequency components. This means that A(t) 
can be deleted in expressions (29), (30) and (31). In our further description we 


s(t) nce ss) 


Fig. 5. Block diagram of the base-band equivalent of a linearly modulated system. 
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Fig. 6. Block diagram of the base-band equivalent of a linearly modulated orthogonal system. 


do indeed delete A(t). From (16) and (17) we know that the functions P'(£) 
and Q'(t) can be expressed as inverse Fourier integrals: 


oo 


11 
Pt) = 55 S H,(@) LH(o + w.) + H,(w—o,)] exp Got) de, — (32) 
7 2 w 


1 j 


9 = 2z 2 


S Hi(9) [H,(@ + @,)— H,(@—,)] exp (jot)dw. (33) 


The impulse responses /,(t) and h,(t) are real time functions, so that 
Hi,4(—0) = Hy, 2*(a). (34) 


If we further write the frequency function (c) H,(w + œ.) in terms of its 
amplitude and its phase function 


H,(@) H,(@ + w.) = Alw + w.) exp [—j (c + e], (35) 
then the expressions (32) and (33) for the in-phase and quadrature components 


can be transformed into 


P(t) = x Tán + we) cos [ot — (w + w,)] do, (36) 


= 00. 


Q') = ~~ J A + we) sin [ot — (ó + @,)] do. (37) 


-00 


Once having calculated the P'(f) and Q’(t) functions, the various impulse 
responses given in (29), (30) and (31) can be found easily. 


2.4. Specification of linear systems under ideal circumstances 


2.4.1. Application of Nyquist's first criterion soy 


The impulse response A, (t) of a base-band data-transmission system with a 
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transmission characteristic 


T(w) = Arw) exp [— $1(v)]. (38) 
is given by the inverse Fourier transform 


oo 


1 
h(t) = vU S Arlo) exp [—j $:(0)] exp (jot) do. (39) 


— 00 


In order to have an impulse response, which is a real function of time, the 
amplitude function A,(w) should have even symmetry and the phase function 
¢,(w) should have odd symmetry with respect to œ = 0. Therefore the im- 
pulse response A(t) can also be written 


h(t) = ns f Ao) cos [wt — $,(w)] do. (40) 


-0 


This last expression is exactly the expression which was found for the in-phase 
component of a linear modulation system. In (36), however, the amplitude 
function A(w + w,) is not necessarily even and the phase function $(o + we) 
is not necessarily odd with respect to œ = 0, as schematically depicted in fig. 7. 
For the description of linear modulation systems under ideal circumstances we 
start with a linear phase characteristic plus a constant: 


$(o + w) = o, + C. (41) 


This characteristic clearly is not odd with respect to œ = 0. Substitution of 
(41) into (36) and (37) yields 


| $9 ‘ 
P'(t) = Pm f Alw + w.) cos [w(t — te) — C] do, (42) 
-0 
base band: A, (6), 5, (co) 
r4 —o-' 
E bandpass:A (9), (6) , 
te p E 


| Pit) 1A (+O) Ø (+ Ge) 
i —-0 


" 
e 


Fig. 7. Frequency representation of a base-band shape factor, a bandpass shape factor and 
its shifted version. 
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1 eo 
Q'(t) = ST f Alw + w.) sin [w(t — t.) — C] do. (43) 
—o 

Using some trigonometric identities the impulse response of the “base-band 
equivalent" then can be obtained from (29) 


ao 


1 
sium J A@ + w.) cos [w(t — T.) — C + y(t)] do. (44) 


—00 


If the phase angle y(t) is chosen to be equal to C, then (44) becomes 


oo 


h(t) — iz f Alw + o) cos w(t — Te) do. (45) 
m 


eo 


It follows from (45) that the impulse response A(t) now consists of only the 
in-phase component. The quadrature component has disappeared by the choice 
of y(t). Further inspection of (45) shows that the linear phase characteristic 
merely delays the impulse response by an amount 1,. Because such a delay 
does not affect any further description, from now on we delete this delay, so that 
the impulse response becomes 


eo 


h(t) = — f Alw + w.) cos wt do. (46) 


-00 


The amplitude function A(w + w.) can always be written as the summation 
of its even and its odd part: 


Alw + we) = A(w + w.) + Alo + o). (47) 


Due to the cosine term in the integrand of (46), only the even part of A(w + w.) 
contributes to the integral. From the literature *:5:9) it is known that for data 
transmission with no intersymbol amplitude distortion at the sample instants 
KT, the shape factor A(w + w.) should fulfil the following constraint (the first 
Nyquist criterion): 


o 


2. 
ba aC + we + n=) = constant, (48) 


n=—0 


where T is the bit length (1/T bits per second). As will be discussed in the next 
section, with the above-mentioned constraint the various linear modulation 
systems can be defined. i 
For orthogonal systems similar discussions apply as for systems with one 
single carrier. It can easily be found that for a linear phase characteristic the 
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direct impulse response and the cross-talk impulse response become 


hat) = z fat + w.) cos w(t — t,) do, (49) 


-0 


1 oo 
hist) = pem f A{@ + o) sin w(t — T.) do. (50) 


-%0 


From the latter expression it becomes clear that cross-talk disappears when the 
odd part of A(w + w.) is equal to zero. For no intersymbol amplitude dis- 
tortion at the sample instants, the constraint given in (48) then also applies 
for orthogonal systems. 


2.4.2. AMSC, PSK-2, SSB, VSB and QAM systems 


In the preceding section we mentioned that for systems with a linear phase 
characteristic together with the amplitude constraint (48), it is possible to 
transmit data without intersymbol amplitude distortion at the sampling instants 
kT. This means that 


] œ 
— j de c) dw; k — 0, 
A(kT) =} 4n atq dp (51) 


0 ; k #0. 


The condition that (48) may be fulfilled, in order to obtain (51), is that 

Aw + w.) has skew symmetry with respect to w = + z/T. This implies that 

the total bandwidth is restricted to the interval [—2x/T, 2x/T] (for a non- 

linear phase characteristic, etc., other methods are possible 5). An example 
for Aew + w.) is depicted in fig. 8a. For various modulation systems 

Alw + w.) now has to be defined such that the even part looks like fig. 8a. 

(a) Taking A(w + w.) = A,(w + we) leads to the AMSC or PSK-2 system 
(fig. 8a). 

(b) Taking A(w + w.) = Aew + w.) for e >0 and A(w + w.) =0 for 
c < 0 leads to the SSB system (fig. 8b). 

(c) If a skew symmetry with respect to w = 0 is introduced in the SSB system, 
we arrive at the VSB system of fig. 8c. Only skew symmetry about 
w = 0 is required; the shape itself does not have any influence on the 
impulse response (in the ideal case). In most cases the shape is chosen to 
be the same as the shape around œw = z/T. 

(d) For the orthogonal system the same characteristics can be taken as for 
the AMSC system, provided that the odd part of A(w + w.) is equal to 
zero (no cross-talk). These systems are often called QAM systems (fig. 8a). 

From fig. 8 it is evident that SSB and VSB systems occupy less bandwidth than 
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Fig. 8. Frequency representation of some shape factors, we, shifted towards w = 0. 


AMSC systems for the same bit rate 1/T. In practice usually some limited band- 
width is available. We therefore want to describe the SSB and VSB systems in 
such a way that they occupy about the same bandwidth as the AMSC system. 
In fig. 8d the bandwidth of the SSB and VSB system has been doubled, giving 
rise to a bit rate of 2/T — 1/v. The corresponding sampled impulse response 
then becomes |: 

1 


AU); d m0, 
W y) = | dee 3, A F 09 do 


(52) 
0 ; k#0, 


where 7/2 = v denotes the new bit length. 

As is clear the QAM system occupies the same bandwidth as the AMSC 
system, but here the bit rate is also twice the bit rate of AMSC. Expression 
(51) applies for QAM for both the “cosine branch" and the “sine branch". 
The bit length in each branch equals 7. The so-called baud rate is 1/T and the 
bit rate is 2/T. 


2.4.3. Theoretical equivalence of VSB and QAM 


In the previous sections we have seen that a linear phase characteristic merely 
gives rise to a delayed version of the impulse response. Deletion of the phase 
characteristic in the expressions for the in-phase and quadrature components 
(36) and (37) simplifies these expressions: 
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P'(t) — x T Ad + @,) cos wt deo, (53) 
Q'(t) = - TAG + w,) sin wt do. (54) 


Now we consider the VSB system of fig. 9a. The shape of both edges has been 
chosen to be similar, so that the function A(w) is symmetrical about c. The 
difference between w, and wọ will be called wy. As follows from fig. 8d we 
then have 


=—, (55) 


MIA 
Oy = — 
NT 2 


where t = 7/2 is the bit length of the VSB system. When we use (55) in (5) 
then the expression for the line signal becomes 


54(t) = P(t) cos (w — wy)t + Q(t) sin (wo — wy), (56) 

where 
P(t) 2 > a, P'(t — nt), (57) 
Q() - Y, a, Q't—n9. (58) 


The a, in the last two expressions contain the information, as can be verified 
from the discussion on impulse responses. Instead of demodulating this line 


E t 
Oc |@o —-0) 


a) gy" 
A+) 
1 
aa) 
Ww b) —*0 
A (+00) 
| Gy 7 —»-0' 
c) 


Fig. 9. Frequency representation of (a) a bandpass shape factor; (b) its shifted version, 
w, shifted towards w = 0; (c) its shifted version, wọ shifted towards w = 0. 
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signal with cos w,¢ (as in fig. 9b) we rather demodulate (as if it were an orthog- 
onal system) with cos cf and sin wot: 


Sect) = (LP(t) cos (wo — wy)t + Q(z) sin (wo — wy)t] cos Wot} * ha(t), (59) 
Ses(t) = ([P(t) cos (wo — @y)t + Q(t) sin (wo — wy)t] sin wot} #h3(t). — (60) 


Using some trigonometric identities and recalling that 43(¢) merely suppresses 
the higher-frequency components, (59) and (60) become 


Sec(t) = 4 [P(t) cos wyt — Q(t) sin cxt], (61) 
Ses(t) = 4 [P(t) sin wyt + Q(t) cos wyt]. (62) 


Combining (53), (54), (57) and (58) with (61) and (62), finally yields (see also 
fig. 9c) 


1p 
Sg, (t) = — | a, A(@ + wo) cos w(t — nt) coe = n) dw, (63) 
4x - ^ 2 


nz-—o 


1 o 
Ses(t) = — f ) a, Aw + wo) cos w(t — nt) sin (= n) dw. (64) 
47 i 2 


Denoting 


o 


1 
p(t) = pur f Aw + wo) cos wt do, (65) 


= 0 


the expressions (63) and (64) become 


Sect) = Y E a, p(t — nt) cos (5 n} (66) 
T 
Set) = 4 P3 a, p(t — nT) sin (= n) (67) 


The function A,(w + wo) meets the first Nyquist constraint, so that 


1 oo 
rM [in J A(w+@)dw; k=0, (63) 


0 E k 5 0.. 
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From (66) and (67) we then observe that the even bits of the VSB signal appear 
at the “cosine” output and the odd bits appear at the "sine" output of the 
orthogonal demodulation, without any disturbance. It should be remarked, 
however, that some precoding is required: 


Sg (kt) = $ a2, PO) (DF, (69) 
Ses[QKk + 1) v] = $ azx+1 PO) (DF. (70) 
It is also possible to start with an orthogonal line signal (with a time lag v 


between the “cosine” branch and the “sine” branch) and demodulate this as 
if it were a VSB signal, but this will not be carried out here. 


2.5. Tabulation of impulse responses 


In this section the in-phase and quadrature components are described with 
respect to the “central” radian frequency wọ rather than with respect to the 
carrier We 

From the previous theory it is clear that for AMSC and QAM systems we 
and wọ are exactly equal. For a base-band system we have w, = wọ = 0. 
For the VSB and SSB systems, however, we have «c, = wọ F Wy, where 
Wy = 7/27 as in the preceding section. The upper sign (—) means upper side- 
band; the lower sign (+) means lower sideband. 

Defining 


1 o 
p(t) = PEN S A@ + wo) cos [wt — p(w + wo)] dw (71) 


and 


" q(t) = — = f Ao + wo) sin [wt — $(c + wo)] da, (72) 


-%0 


we find for base-band, AMSC and QAM systems that P'(f) = p(t) and 
Q'(t) = q(t). For the VSB and SSB systems we find from (36) and (37) after 
substitution of w, = wo F wy: 


P'(t)ssp = p(t) cos wyt + g(t) sin wyt, (73) 
Q'(t)sss = q(t) cos wynt F p(t) sin wyt. (74) 
The impulse responses of the various systems are collected in table I, expressed 


in the basic functions p(t) and q(t). 


2.6. Signal-space diagrams 


When the line signal of a modulation system is described as 


A(t) cos [ot + 4(t)], 


system 


base band 


AMSC, 
PSK-2 


VSB, SSB 


QAM 


TABLE I 


Impulse responses 


impulse responses with respect to wo 


P(t) 
4 [p(t) cos p(t) + g(t) sin p(t)] 


Nyquist bandwidth 
[lwo — z/T, wo + zx[T] 


bit length 


bit rate 


4 [p (t)cos wyt + g(t) sin wyt] cos p(t) 
+4 [a(t) cos wyt F p(t) sin wyt] sin y(t) 


4 [p(t) cos y(t) + q(t) sin p(t)] 3 [9(t) cos p(t) — p(t) sin p(t)] 


1/T 
1/T 
t = TJ2 J2JT 
2 times T 2/T 


SWHLSAS NOISSINSNVUL-VLVC NI SONINHdO HAT ASVO-LSUOM 


Iob 
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where wọ is the “centre” of the pulse spectrum, the so-called signal-space 
diagram 7) is nothing but a polar plot of A(t) and ¢(t). This polar plot, or “the 
path of the signal vector with respect to wot”, is a useful aid to illustrate 
similarities and differences among modulation systems. Further, in the case 
of channel distortions, noise, jitter, etc., the signal-space diagram certainly 
contributes to a better insight into the behaviour of various systems. Instead 
of the amplitude function A(t) and the phase function ¢(¢) its X and Y com- 
ponents can also be used, for 


A(t) cos [wot + ¢(t)] = X(t) cos wot + Y(t) sin wot. (5a) 


This expression is thus exactly equal to the expression (5) for the line signal. 
For AMSC and QAM systems c, is indeed the centre of the pulse spectrum. 
Their signal-space diagrams therefore follow directly from (5) and (11): 


AMSC: 
S4(f) = P(t) cos Wot + Q(t) sin wot, (75) 
QAM: 
Sao(t) = [Pc(t) — Q.(t)] cos wot + IP:t) + Q-(t)] sin wot. (76) 


For VSB and SSB systems w, is not the centre of the pulse spectrum, but, as 
follows from the preceding section: we = Wo F Wy. The (—) sign stands for 
systems where the upper sideband is used and the (-+) sign for the lower side- 
band. From (5) then the signal-space diagram becomes 


S4(t) = [P(t) cos wyt F Q(t) sin wyt] cos wot + 
+ [+ P(t) sin wyt + Q(t) cos wyt] sin wot. (77) 
In order to specify the signal-space diagrams, we have to introduce a de- 
scription of the input signal s,(¢). For digital data signals we have 


0 


s(t) = }, a ô(t—nT), (78) 


n--—0 


2 
where a, = 1, l = —1 (=) 1; 
L—1 
T = bit length (or 7/2 = t for VSB); 
L = number of levels. 


The in-phase and quadrature components then can be written: 


P(t) = y a, P'(t — nT), (79) 
Qt)= > aQ(—nT) . (80) 


n=—@ 


base band 


AMSC, PSK-2 


VSB, SSB 


QAM 


TABLE II 


Signal-space diagrams 


Pa An ptt mi nT) 
Ya pt—nT) 
>. a, [p(t — nt) cos x n[2 F q(t E nt) sin z n[2] 


2, lan p(t — nT) — b, «(t — nT)] 


X a q(t — nT) 


È an [4 p(t — nz) sin z n[2 + q(t — nz) cos x n[2] 


b» [bn p(t — nT) + An q(t —nT)] 
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b) 
C) 
l 
d) 


Fig. 10. (a) Raised cosine shape factor for a VSB lower-sideband system with (b) its two- 
level signal-space diagram and (c) its two-level sampled signal-space diagram; (d) its four- 
level signal-space diagram. 
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Substitution of (79) and (80) into (75), (76) and (77), while using the p(t) and 
q(t) functions (71) and (72), leads to the final expressions for the signal-space 
diagrams given in table II. 

As an illustration fig. 10b shows the signal-space diagram for a VSB system, 
using the upper sideband, with a shape factor as depicted in fig. 10a. Compari- 
son of this signal-space diagram with that of an FSK. signal with modulation 
index m = 0:5, as given in ref. 9, reveals that the line signals ofa VSB system 
and of an FSK (m — 0:5) system are exactly equal. If the signal-space dia- 
gram is sampled at the instants kt, the resulting diagram looks like fig. 10c. 
From this sampled signal-space diagram we observe similarities with orthogonal 
systems, as described in sec. 2.4.3. For multilevel systems the signal-space dia- 
grams become more complicated very rapidly. Figure 10d shows the signal- 
space diagram for a four-level VSB system with the shape factor of fig. 10a. 


3. Eye openings for linear data-transmission systems 
3.1. Systems with only a direct impulse response 


3.1.1. Definition of the eye pattern and of the worst-case eye open- 
ing for binary signals 
From the previous section it has become clear that linear modulation systems 
are characterized by the impulse response A(t) of their *base-band" equivalent. 
The output signal s,(¢) can therefore be described as 


ie) 


Se(t) = 2; On h(t i nT), (81) 


n=— © 


where a, = / with / = —1 (—) 1, 
L—1 
T = bit length, 
L = number of levels. 
For the regeneration of the information sequence {a,}, the signal s,(t) is in 
general sampled at the instants t = fọ + kT, where —7/2 < t, < T/2 and k 
is an integer ranging from —oo to oo. Hence from expression (81) we obtain: 


S6(to + KT) = ay h(to) + y a, (t + kT — nT). (82) 


nák 
For the ideal case, where the impulse response A(t) fulfils the first Nyquist 
criterion, we observe that for t — 0 eq. (82) reduces to the simple form 
Se(kT) = a, h(0). This means that the kth information digit a, has been 
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recovered without any disturbance of its neighbours. If, however, for some 
reason, the sampling instants are taken at eT + KT, then the summation term 
in (82) is no longer equal to zero. Thus interference of the various information 
digits takes place. This interference is called intersymbol interference. For 
impulse responses, which do not fulfil the first Nyquist criterion, the inter- 
symbol interference is always present, because there it is not possible to find 
any sampling instant where the summation term of (82) vanishes. Inherently 
related to the intersymbol interference are the so-called eye pattern and the 
worst-case eye opening, with which it is possible to judge the quality of the 
data-transmission system. They will be considered below. 

Referring to eqs (81) and (82) we observe that (82) is nothing but the func- 
tion of (81) on the kth segment of the time axis, defined in the interval 
(k —3) T « t « (k -- 3) T and shifted down along the t-axis over kT. Thus 
for each integer k an expression like (82) exists and hence we have an infinite 
set of functions, defined in the interval [— 7/2, 7/2]. With the introduction 
of a change of variables: m — n — k in (82), the set of functions can be de- 
scribed as 


{ep(to)} = (ax Alto) + Y, Aman Alto — mT)); (83) 
pos 
—o-«ck-«o, —T/2<t) <7/2. 


This expression gives the mathematical description of what is called the eye 
pattern of a data-transmission system. In practice (part of) this eye pattern 
can be displayed on an oscilloscope by using a horizontal sweep at a rate 1/T 
and applying the signal s,(t) to the vertical amplifier. If we restrict ourselves 
to two-level system, the a, can have either the value +1 or —1, according to 
(81). For a VSB system for example, using the upper sideband, with a shape 
factor as depicted in fig. 11a, the eye pattern becomes as plotted in fig. 11b 
for some specified sequence of a,. It will be clear that for each instant of time to 
there exists one sequence of a, for which the summation term of (83) becomes 
a maximum (in positive or negative sense). This maximum occurs when all inter- 
fering digits add together. Hence the maximum intersymbol interference is 
easily found by taking the absolute value of each of the terms of the summation 
in (83). We can thus describe the inner contour and the outer contour of the 
eye pattern (for positive values only) as follows: 


Cito) = (to) FY ans h(t — mT), (84) 


m-2-—90 


where the prime means deletion of m — 0. As, in (84), las] = 1, for binary 
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b) 
Exp (to) 
Ey (to) 
» io 
c) T 
Fig. 11. (a) Shape factor for a VSB lower-sideband system with (b) its corresponding eye 


pattern and worst-case eye opening; (c) the definition of the traces of the worst-case eye 
opening. 


data signals the summation becomes k-invariant and therefore the worst-case 
eye opening or eye opening is defined as 


oo 


E,i(to) = Alto) — Y! |h(t; — mr), (85) 


m= — oo 


for values of ty, where E, ,(t9) > 0. Figure 11c shows the function E» ito), 
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Fig. 12. Eye pattern and worst-case eye opening of the system of fig. 11a with some channel 
distortion. 


and also E; o(to) for which a (+) sign applies in (85). In fig. 115 the eye opening 

is depicted by means of the dashed curve. Figure 12 shows the eye pattern of the 

same VSB system as used for fig. 11, but now with a group-delay distortion. 

Again the eye opening is depicted by means of the dashed curve. From this 

figure some important observations can be made. 

(a) The time instant where the eye opening is maximum should be used as the 
sampling instant, because at that instant the margin against noise is as large 
as possible. 

(b) In this respect the “density” function of the intersymbol interference is 
important for the calculation of the bit error probability. 

(c) The maximum value of the eye opening does not in general coincide with 
the maximum value of the impulse response. 

(d) Variation of the sampling instant from its optimum value gives a degrada- 
tion of the eye opening. The eye opening is thus a measure for the sensitivity 
of the system with respect to timing errors. 


3.1.2. Multilevel worst-case eye openings 


After having defined the worst-case eye opening for a two-level data signal 
it is quite easy to arrive at a general expression for multilevel eye openings, 
including the two-level case. Figure 13 shows schematically two multilevel eye 
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— >t, —>rt, 
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Fig. 13. Definition of the traces of multilevel worst-case eye openings. 


openings: (a) for an even number of levels and (b) for an odd number of levels. 
The various traces of the eye openings are indicated by Ej , where L?stands 
for the number of levels and v ranges from 0 to L — 1. Using (81) for the 
definition of the levels and using (85) for the definition of eye openings we 
can write: 


Ex, o(to) = (to) + Kto), 
E,,1(to) = (to) — Kto), 


E, to) =o Hy + Kto), (86) 
Ez, ato) = —— Ms) — Kto), 
Fest) = Fate) + Mo) 

where ns = P |A(tp — mT)| and L z 2. 


Closer examination of (86) reveals that subsequent traces can be expressed in 
each other. Doing this we find the following recursive equation (which is well 
suited for computational purposes): 


L—» 


E, (t) = 2 
Li) = 2 — 


(to) — Ex,v-1(¢o)s (87) 


where v = 1 (1) L— 1 and Ez,o as in (86). 
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` By successive substitution, eq. (87) can be written in another way: 
yi 
med h(t)), y (,—i); »v-1()DL—1. 
ee (88) 
The traces E; , are defined for positive values only and then for those values 
of tọ where Ej ;, S Er,21-1; i= 1, 2, 3, ..., as follows from fig. 13. 


Ex, (to) = C1)” Er, o(to) + 


3.1.3. Worst-case eye openings for coded binary data signals 


The definition of the eye opening in the preceding sections is based upon the 
fact that each digit is completely independent of the others. Thus for such data 
signals any possible sequence of digits is allowed. For coded data signals this 
is no longer true, because subsequent digits may depend on each other. In other 
words, coded data signals have some form of a correlation, depending upon the 
coding rules. It is evident then that these coding rules should be incorporated 
in the calculation of the eye opening. In principle each different code will give 
rise to a different eye opening and hence a general formula cannot be obtained. 
In this section, therefore, only a special class of coded data signals will be 
treated, known as duobinary !?) and bipolar !*) coded signals. These codes 
are called pseudo-ternary codes, because the binary input data signal is trans- 
formed into a three-level signal with some specific properties. We assume that 
the encoding takes place in front of the low-pass filter LP, of fig. 1. The prin- 
ciple of the two codes is that a delayed version of the input data signal is added 
to (duobinary) or subtracted from (bipolar) the original input data signal. The 
principle of the encoding is shown in figs 14a and 14b. If the encoded input 
signal is passed through the system of fig. 1, the new output data signal, which 
will be denoted by s,(t), can be described as 


foe} 


S(t) = Y a, {h(t — nT) + h[t — (n + N) T). (89) 


n=- 0 
where a, = +1, 
T = bit length, 
N = order of the code (delay = NT), 
A(t) = the impulse response of the system without coding. 
The impulse response of the system with coding included then becomes from 
(89): 


g(t) = A(t) + A(t — NT), (90) 
where the (+) sign stands for duobinary coding and 
the (—) sign for bipolar coding. 


Figure 14c shows sketches of (90) for N = 2. For easy detection of these pseudo- 
ternary data signals some additional pre-coding network should be incorporat- 
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Fig. 14. Bipolar and duobinary signals; (a) example; (b) implementation and (c) impulse 
response of systems with duobinary or bipolar encoding. 


ed. Because this pre-coding does not affect the eye opening it will be disregarded 
here and reference is made to the literature ?:19713), The output data signal 
$7(t) of (89) is a three-level signal, giving rise to a three-level eye opening. The 
three levels are determined by the digits ay and a_y. The outer level (positive) 
is obtained by taking ag = 1 and a_y = + 1, whereas the zero level is ob- 
tained by taking ag = 1 and a_y = F 1 (upper sign for duobinary and 
lower sign for bipolar coding). The contribution of the maximum intersymbol 
interference here is the summation of the absolute values of the shifted versions 
of the impulse response, except for a shift of 0 and —NT. The worst-case eye 
openings then become, using the same notation as in (86), 


E3,o(to) = t [g(to) + g(to + NT) + I(to)], 
E3,1(to) = 5 [g(to) + g(to + NT) — I(to)], (91) 
Es,2(to) = 4 Ileto) F gto + NT)| + Kt9)], 

where Kto) = 5 lg(to — mT)|; 


m=- 0 
. m#0,-N 
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the upper signs apply to duobinary coding, the lower signs to bipolar 
coding; the factor 3 normalises the outer level to 1. 


3.1.4. Coded multilevel data signals 


As stated already it is not possible to obtain general formulae for the eye 
openings of coded data signals. Therefore, in this section we confine ourselves 
to one example, i.e. the application of the bipolar technique to a four-level 
signal. Starting with a binary data signal, the four-level signal is obtained by 
assigning one out of four possible values to each combination of two bits 
(dibits). After bipolar encoding, according to fig. 14b, and after passing through 
a linear modulation system, the output data signal becomes 


s(t) = x b, g(t — nT), (92) 


n=—-@o 


where g(t) is as defined in (90), and 
b, T" +1, +4, —L —1. 


Figure 15a shows a block diagram of the encoding. The output signal of (92) 
is a seven-level signal, as may be verified from table II. 


F t 
: - pa = “i i 
g) 


Fig. 15. (a) Straightforward seven-level bipolar encoding block diagram and (b) seven-level 
bipolar Van Gerwen encoding block diagram. 
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TABLE III 


For the calculation of the eye openings a new phenomenon has arisen, due to 
the fact that several levels are not unique. This means that different input levels 
may give rise to output levels equal to each other; this effect is indicated in 
table III by the dashed lines. For the various traces of the eye openings this 
implies that maximum or minimum values have to be taken for the possible 
combinations. Doing this, the eye openings become: 


Es,o(to) = t [g(to) — (to + NT) + Kto)], . 

E;,1(to) = 5 [g(to) — (to + NT) — (to) 

Ey,2(to) = 4 [max4(to) + I(to)] 

E,,3(fo) = X [ming(to) — J(£9)], (92) 
E;,4(to) = 5 [maxg(to) + It9)], 

E,,5(to) = 4 [ming (to) — I(¢o)], 

E3,6(to) =4 Ileto) + 8o + NT)| + Kto)], 


oo 


where Kt) = > lecto = mT)|; 


ma— oO 
m#0,-N 


max4(fo) and min,(fo) stand for the maximum and minimum of 
{(o) + $8(to + NT), $8(to.)—48(to + NT), — 8lto)— glt + NT)); 
max4(£9) and min,(fo) stand for the maximum and minimum of 
{8(to.) —4.8(to + NT), telt) — elt + NT)}. 


For proper detection of the seven-level signal, as described here, a special en- 
coder is required in front of the binary to four-level conversion (fig. 15a). Such 
an encoder has been described by Lender 14). 


414 H. C. van den ELZEN 


An alternative way of obtaining a seven-level bipolar data signal has been 
described by Sluyter and Van Gerwen !5). Instead of starting with a four-level 
data signal, they use four binary paths (see fig. 15b) having a value +1 if the 
corresponding dibit is present and being a —1 if not, as indicated in table IV. 


Each path contains a pre-encoder, a bipolar encoder and multiplication factor, 
with values 0, 4, 2, 1. The path with the multiplication factor 0 can of course 
be deleted (fig. 15b). The outputs of the three paths are added together and 
after passing through a linear modulation system, the output data signal now 
becomes: 


S(t) — Y, (Gbin + $ bzn + bs) (t — nT), (93) 


na~% 


where g(t) is as defined in (90); 
bin = +1, —1 or 0, according to the following rule (see ref. 15): if 
bin = + 1 then bjn = 0; Lj = 1, 2, 3; ij. 
From (93) it follows that the various levels are unique (except for the zero 
level). For the eye openings this implies that expression (92) is simplified and 
we have 


E,o(to) = $ [g(to) — g(to + NT) + Kto), 
E3,4(to) = 4 [g(to) — g(to + NT) — Kto)], 
E4,X(to) = $ [ gto) — $ g(to + NT) + Ito), 
E7,3(t0) = + [$ gto) — 4 g(to + NT) — Kto)], (94) 
Ej, (to) = 3 [$ 8o) — $ g(to + NT) + Kt)], | 
Es, s(to) = $ B gto) — $ g(to + NT) — Kto)], 
Es,e(to) = 3 l|g(to) + (to + NT)| + to), 
where J(f9) is as in (92). 


As an example, fig. 16 shows that the actual difference of the eye openings, 
calculated with (92) and with (94), is small. 
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Fig. 16. Seven-level bipolar worst-case eye openings according to figs 15a and 15b. 


3.2. Systems with a direct and a cross-talk impulse response 


3.2.1. Systems with independent channels 


From sec. 2 we know that for orthogonal systems we have two basic impulse 
responses, the direct impulse response and the cross-talk impulse response 
(411(t) and A;,(t)). For the calculation of the eye openings of such systems 
this implies that in addition to the intersymbol interference of the direct im- 
pulse response itself, the interference due to the cross-talk impulse response 
should also be incorporated. If the two channels are independent, then the total 
interference becomes simply the addition of the intersymbol interference and 
the cross-talk interference. Thus 


L(to) = 23, [s (to = mT)| + Ih, (to s mT)] + |21(to)|- (95) 


mz-—eoo 


Substitution of (95) rather than I(t.) into (86) and changing /(t;) into A, (tg) 
gives the eye openings for orthogonal systems with independent channels. The 
independence can be seen from the sampled signal-space diagram, as shown in 
fig. 17 for an orthogonal four-level system. For each possible value in the 
`x channel, all possible values of the y channel can occur. s 


3.2.2. Systems with dependent channels 


For orthogonal systems having some dependence between the two channels, 
the calculation of eye openings differs from the calculation in the previous 
section. Because it is not possible to give general formulae, we again confine 
ourselves to two specific examples | in order to show how the “dependence 
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Fig. 18. Sampled signal-space diagrams of (a) eight-phase systems and of (b) four-phase/four- 
level systems. 


rules? are being used. The sampled signal-space diagrams are depicted in 
figs 18a and 18b (the signal represented in fig. 18a can also be looked upon as 
an eight-phase signal; the signal represented in fig. 18b can also be Jooked upon 
as a four-phase, four-level signal). For orthogonal demodulation it no longer 
holds true that for each value in the x channel all possible values in the y channel 
are admitted, but the dependence rules are as follows. For fig. 18a: 


96 
x= + (2—1) 2y= l1; i 
for fig. 18b: 
x=4+1 sy=+1, 
9 
ixi) arse Wad. d 


For the system of fig. 18a the total interference is not found by simply adding 
the contribution of the direct and the cross-talk impulse response, but a 
maximum has to be found from the two possible combinations. Thus we have 


(v — 1) [at — mT)| + [last — i (98) 


Traax(to) - 2; max lis (to — mT) + (y2 = 1) |h21(fo a mT)| 


m=- 0 
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With the aid of the dependence rules, the eye openings then become 


E, (to) = hy 1(to) + (2 EE 1) |ho1(to)| + Tnax(to)s 
E4,1(to) = hii(t9) — (V2 — 1) [hai (9) — Inax(to)» 


. 99) 
E4,2(to) = (V2 — 1) h11(to) + |h21(to)| + Inax(to)s 
E4,3(to) = (V2 — 1) hi 1(to) — |ho1(to)| — Inax(to). 
For the system of fig. 18b the total interference becomes 
I/(fg) = > [lA11(to —mT)| T lho 1(to —mT)|], (100) 
and, applying the dependence rules, ‘the eye openings become 
Es, o(to) = hii(to) + |h21(to)| + D (to), 
Es,1(to) = hii(to) — |%21(to)| — I, (to), (101) 


E4,2(to) = (y2— 1) Ayi(fo) + (/2— 1) |h21(to)| + T,'(to)s 
Es s(to) = (J2 — 1) ii (to) — (y2 — 1) 15:9) — K (to). 


It is clear that, for different dependence rules, different eye openings are ob- 
tained. In general, for finding the eye openings for orthogonal systems, the 
sampled signal-space diagram is of great importance, because the dependence 
rules can be easily derived. 


4. Considerations on the actual calculation of worst-case eye openings 


4.1. The discrete Fourier transform as a convenient tool 


In sec. 2 it was shown that the functions p(t) and q(t), as defined in (71) 
and (72), are the basic functions which characterize linear data-transmission 
systems. Once these functions are known, the impulse response and, if neces- 
sary, the signal-space diagram can be calculated. Consequently, from the im- 
pulse response the worst-case eye openings can be calculated. Clearly it is 
worthwhile to have an adequate method to calculate the p(t) and q(t) func- 
tions. The “base-band” functions A(w -+ wo) and 4(w + wo) in (71) and (72) 
can be split into their even and odd parts: 


Alw T wo) = Ae + Ag, 


(102) 
p(w + wo) = $e + Po 


so that for the p(t) and q(t) functions we arrive at 
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1 ro) 
P(t) Tn J [Ae cos $e cos à, — Ao sin be sin $o + 


-%0 


—j (A, cos ¢, sin ¢, + A, sin ¢, COS ¢,)] exp (jwt) dw, (103) 


1 o 
q(t) = D f [Ae sin e cos d, + A, cos $, sin $y + 


-0 


—j (A, sin he sin $o — A, COS $e COS $o) ] exp (jwt) do. (104) 


For ideal circumstances, such as Nyquist amplitude functions with linear phase 
functions, the above integrals can sometimes be calculated “by hand”. In 
general, however, when amplitude and/or phase distortions are present, it is not 
possible to obtain analytical expressions for the p(t) and q(t) functions. This 
means that numerical computations must be applied, such as for example the 
discrete Fourier transform (DFT), which has become a very commonly used 
tool nowadays. In practice this DFT can be performed in a very quick mode, 
called the fast Fourier transform or FFT !5). The DFT, when properly used, 
is a good approximation to the continuous Fourier transform. We consider 
below the theory of the DFT very briefly. The definition of the DFT in relation 
to the continuous Fourier transform is as follows: 

If f(t) F(w) forms a Fourier transform pair, where f(t) stands for a 

time function and F(w) stands for its related frequency function, then the 

corresponding DFT pair becomes 


T f, (kv) <> F (nwo), 


with f,(kt) and F,(nwo) as the sampled versions of the periodically repeated 
functions f (t) and F(w). The period of the time function is T, = Nv; the 
period of the frequency function is equal to Q, = Nwo. The "time resolu- 
tion" v and the “frequency resolution” wọ are related by wot = 2n/N. 
The actual transform pair can be written as !7:19:19) 


N-1 


F k ee 
p(NW) = 2. T f, (kx) exp (4 N r) 


2 (105) 


N-1 
1 2 
afk) = — ) F (nox) exp ( j Z k n) 
n=0 7 


Properties of the DFT pair can be found in the literature and will not be 
given here. 
For band-limited frequency functions, having an “equivalent” base band 
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with bandwidth B, the function F,(nw 9) is equal to F(w = nwo) if B < Nwo. 
The related impulse response f(t) is then generally an oscillating continuous 
function, ranging up to infinity, with a decreasing envelope. A periodic repeti- 
tion of such a function then inevitably leads to overlap of its shifted versions. 
This overlap is called “aliasing”. Due to the decreasing character of the en- 
velope of the time function, the aliasing error becomes small for a sufficiently 
large period Nr. Therefore 


fk) = y f (kc —INt) wf(t=kt); k=0(1),N—1. (106) 


I--—co 
In a later section more will be said about the actual choice of the period Nr. 


4.2. Relation between the Nyquist frequency fw, the bit length T and the period Nx 


As is known, for a base-band data-transmission system, having a Nyquist 
frequency fy, the related bit length is equal to T = 1/2 fy. In the discrete case 
with a frequency resolution fọ and a time resolution r, the Nyquist frequency 
can be written as fn = fy’ fo and the bit length can be written as T = T" t, 
where fy’ and 7" are integers indicating the number of samples in the Nyquist 
interval and the bit interval. The above-mentioned relation then becomes 


1 
T'y————. 
2 fw fo 
From the definition of the DFT we have c f; = 1/N, so that 
T' 2d (107) 
2f 


Because the total number of samples in one period of the DFT is equal to N, 
the total number of bits contained in one period is equal to N/T”, or with (107): 
2 fy’. In practice, the DFT is usually used in a rapid mode, the FFT radix 2. 
This means that N should be a power of 2; thus N — 2". With the above 
relationships we then arrive at table V. From the table it follows that, unless 
N becomes very large, a large number fy’ is related to a small number of samples 
per bit (poor time resolution) and a large number of bits in the interval Nr. 
On the other hand, a small number fy’ (poor frequency resolution) is related 
to a small number of bits in the interval Nv and a large number of samples 
per bit (good time resolution). 


4.3. Numerical approximation of the worst-case eye opening 


From the definition of the worst-case eye opening as given in (85) it follows 
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TABLE V 


number of bits 


that a summation up to infinity should be performed. In other words, this 
means that an infinite number of bits should be taken into account. Of course, 
for a numerical calculation, this is not possible and the actual calculation will 
be restricted to a number of 24 bits. Thus the approximated two-level eye 
opening becomes 


A D T 
EO =O Y pe-—nsr) |t| < 2 (108) 
n--A 
The error made by this approximation is 
e(t) = E(t) — E2,1(t), (109) 
or 
o -A-1 T 
et)= Y hest E esan js zo q0 
n=A+1 n=- 0 


This error will be studied more thoroughly in this section. We replace the 
impulse response /(¢) in (110) by its inverse Fourier transform 


1 © 
h(t — nT) = D f Hw) exp [jo (t — nT)] do. (111) 
Teo 
After some manipulation, the error signal in (110) then becomes 


Lom o , 
«Decr Y [| / Ko) exp [jo (t — 17)] do| + 


IT n=4+1 -o0 


+ | f. Bo) exp [jo (t + nT)] dol]. (112) 
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Integration of (112) by parts m times (originated by Kristiansson 20)) yields 


` 


ao 


1 > exp [jw (t — nT)] 
Ie 15m (m) 
M" D (| d j 08 any $e i 


n=A+1 
asm f eo EXP Lie t+) ) 
+k 1) Í Hlo) UTE dol). (113 


Manipulating the absolute values and normalising with respect to T we easily 
arrive at the following inequality: 


oo 


et f pre do > <= + v) (114) 
2z i (n — t)” (n + t)" 


n=A+1 


The “frequency” part in this expression is a constant C, depending on the 
actual shape factor under consideration. The “time” part of (114) can be 
worked out further by using the following identities and inequality: . 


1 oo 


MEE f xM-1 er xt) gy — —_ ; n-ct 0, (115) 
(m — 1)! 4 (n+ t)" 
~ e- (4* 0x 
Y em = =3  0«x«o, (116) 
n-A-1 1 =e 
l-e*pxre*;, O0<x< om. (117) 


After some algebra we then finally arrive at 


1 1 1 
t) € C—-| — + — }; |i] <4, m>1, (118 
ý a Gea KS m VD 


1 oo 
where C= 35 S |H*(o)| do. (119) 
To 


The upper bound of e(t) thus becomes the tighter the more times the integration 
by parts can take place. Integration by parts, however, means that the derivative 
of the shape factor has to be taken, and this is not possible for an unlimited 
number of times (after a certain number of derivatives delta functions appear 
and the differentiation cannot be continued). 
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As an example we take the very commonly used racos-« shape factor, 
symmetrical around w = 0 (see fig. 19): 
1 ; Oxlo| & (1— 0) z, 
, flel— 
H(o) = 1*| 1 — sin (=) ; Aan <lo «(0 +a), (120) 
a 
0 lel > (1 + a) z. 
Differentiating thrice with respect to œ yields 
Ll eos TT, 1a) a «le « o) 
- —— cos ——— ; — a) zx: « |o a) 7t, 
. 2 Qa? 2a bu 
sign (w). H@(w)= (121) 
[êlo — (1 — a) x) + à(e — (1 + a) 22]. 


2 (2a)? 


Substitution into (119) yields after performing the integral, 


1 


C= 122 
2z a? gen 


and the error becomes 


(G-a)  (t*o)?T —»o5 


H (a2) 


Fig. 19. Racos-« base-band shape factor and its first, second and third derivatives. 
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: l I : < 123 
S a aa aca} e| « 3. (123) 


For large values of A (with respect to 7) the error reduces to the very simple 
expression 


a(t)  ———-. (124) 


Figure 20 shows this upper bound (as a percentage with respect to the maximum 
eye opening, put equal to 1) versus the total number of interfering bits (24), for 
various values of the roll-off factor. As follows from fig. 20, the error increases 
for decreasing roll-off factor. Taking an error < 1% we observe that the 
number of interfering bits lies within the interval of 10 to 100. 

In the case of amplitude and/or phase distortion, it can be verified that the 
error becomes larger, depending on the amount of distortion 2°); in the 
author's experience, however, it suffices in general to take 50 to 100 bits into 
account to obtain good results. Only for systems with a roll-off factor of say 
a< 0:1 it will be necessary to take into account more interfering bits. The 
choice of the period Nr for the DFT remained as a question from sec. 4.1. 
With the foregoing theory now this becomes simple. We can state that N has 
to be such that at least 24 bits are contained within the period Nr. If, for 
example, we choose to take into account 64 interfering bits and if we want to 
have a time resolution of 32 samples per bit, then from table V we find a 
normalized Nyquist frequency fy’ = 32 and a total number of samples 
N = 21! = 2048. 


WN 
E (96) vac 
IJ à as 


N WN Ht 
A 


10? 
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Fig. 20. Graph of the truncation error for the two-level eye opening versus the number of 
bits taken into account for racos shape factors. 
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4.4. The maximum value of the worst-case eye opening 


As stated already, the margin against noise is as large as possible when the 
eye opening reaches its maximum value. Therefore in this section the maximum 
eye opening is considered in more detail. 

The impulse response h(t), from which the eye opening is calculated, con- 
sists of a combination of the basic functions p(t) and g(t) and of the phase 
angle y(t) of the demodulating carrier. In general this phase angle is com- 
posed of a constant yg and a time-varying function y,(t), preferably having 
a small amplitude, the so-called jitter of the carrier. In this section we neglect 
the jitter and we consider only the constant phase angle yo. The functions p(t) 
and q(t) are fully determined by the spectral characteristics of the modulation 
system (in terms of “equivalent base-band” amplitude and phase characteristics). 
For a given system the maximum eye opening therefore still depends on the 
phase angle yo and on the sampling instant fo. Thus expression (85) for the 
two-level eye opening can be modified as follows: 


E; i(fo, Vo) = Altos Yo) — P» |A(to — mT, Yo)|s (125) 


m--60 


for values of tọ such that E, (fo, vo) È 0. 
The optimum values £9,,,, and o,opt, for which the eye opening E;,; be- 
comes maximum, can normally be obtained by solving the equations 


OE, i(to, Vo) OE, (fo, Vo) 
—— = 0 and —— = 


0. 126 
Oo Oyo ce) 


Unfortunately it is impossible to obtain an analytical solution for (126), and 
therefore we have to look for other methods to arrive at the desired optimum, 
approximately. From practical experience (see e.g. fig. 12) and from the litera- 
ture 3-21) it is known that for moderate transmission distortion, the maxima 
of the impulse response and of the eye opening, though at different instants of 
time, are in the vicinity of tj = 0. Hence if an “optimum” for the impulse 
response has been found, we know that we are rather close in the vicinity of 
the maximum eye opening. Thus a first approximation to (126) is 


Oto, oA(to, 
(fo Y) o and (to Po) _ 


0. 127 
dto Oo Ge) 


To facilitate our further discussions, we use from now on the impulse response 
of the AMSC system *) (see table I): 


*) Other systems can be used just as well. 
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(to, Yo) = p(to) cos Yo + q(to) sin Yo. (128) 


Application of (127) and (128) yields 


dp(to) en dq(to) xis 0, 
prr Vo dt, Vo — 
. (129) 
—p(to) sin Yo + (to) cos yg = 0. 
From the second equation of (129) it follows that 
t 
yo = arctan alto) (130) 
P(to) 


This expression is valid for all tj and thus for the optimum value of f, too. 
Then, by substitution of (130) into the first equation of (129) we have 


= o) t 


P(to) + «(t en 


=0. (131) 


The functions p(t) and q(t) have been defined in expressions (71) and (72). 
Substituting them into (131) while taking sin x ~ x and cos x ~ 1 we obtain 


[Ado f w Alwto — $) dw + f Alwto — 4) do fw Ado — 0, (132) 


where A and ¢ are functions of w + wo, and where the limits of the integrals 
have been ignored. | 

From (132) we finally find an expression for the approximated "pan 
sampling instant fo: 


; | [Ado foAgddw + [A$do [v Ado jäs 
uin [ A do fw? Adw+ (f o A doy? em 


The “optimum” for yp now follows by substitution of t, into (130). 
For an amplitude characteristic which is even about the carrier frequency fy, 
the expression for t, simplifies considerably: 


| [9e Aédo 
A= heads” (134) 


If, for example, the phase characteristic is restricted to two different cases, i.e. 
(a) even phase distortion and (b) odd phase distortion, then the “optimum” 
formulae become 
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(a) even phase distortion: 

t 0 — 0, 
` Po = arctan so : 
e , 
pO 


(b) odd phase distortion: 


[94A4$do 
P= 
° fw Ado 
Po = 0. 


In case (b) we find an “optimum” tọ, which is equal to the optimum for base- 
band systems, as has been found by Lucky +). For a base-band system A(w) 
is indeed even about w = 0 and ¢(w) is odd about w = 0. 

By a direct observation of the p(t) and q(t) functions we can roughly verify 
the above approximations: 

(a) for an even phase distortion and an even amplitude characteristic with 
respect to wa the p(t) and q(t) functions are even functions of time and 
thus the impulse response has its maximum at tọ = 0; 

(b) for an odd phase distortion and an even amplitude characteristic with 
respect to w,, the g(t) function vanishes and the resulting impulse response 
is p(t) cos yo, which of course is as large as possible for yo = 0. 


Example 


As a numerical example let us take the amplitude characteristic to have a 
racos-} shape. For the phase distortion let us take a quadratic shape for case (a) 
and a cubic shape for case (b). These characteristics are defined in fig. 21. 
Carrying out the necessary integrals we then obtain for the “optimum” values: 
(a) to — 0, 

yo = 0-29 radians; 
(b) to = 0:12 T, 

Yo = 0. 
In order to get more insight and in order to verify the above approximated 
values we calculated the eye openings for various values of yy. 

Figure 22a shows the results for the quadratic phase distortion and fig. 22b 
shows the results for the cubic phase distortion. As will be clear, the optimum 
eye opening is found by walking over the “mountain ridge" in the three- 
dimensional fig. 22. Figure 23a shows the projection of the ridge of fig. 22a 
on the eye,,,—yo-plane, while fig. 235 shows the eye opening for yg = 0 
from fig. 22b. Figure 23a shows further the projection of the ridge of the impulse 
responses and fig. 235 also shows the impulse response for yo = 0. The “op- 
timum" values, found by the approximation, are very close to the real optimum 
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Fig. 21. Base-band equivalent shape factor with a quadratic phase distortion characteristic 
and with a cubic phase distortion characteristic. 


values of the impulse responses, as can be verified from fig. 23. Figure 23 clearly 

shows the difference between the optima for the impulse responses and for the 

eye openings. 

In general, to find the optimum eye openings, it is easier to carry out numerical 
calculations, as has been done in fig. 22, than to start with a calculation of the 
approximate optimum values for tp and wo, followed by a search in their 
vicinities. For comparison purposes it is worthwhile to normalize the eye 
opening with respect to some other value. This means that the eye opening 
can then be expressed in %. There are several possibilities, such as 
(a) normalization with respect to the value of the impulse response A(t) at 

t = 0; this normalization is rather arbitrary, because neither the impulse 
response nor the eye opening has its maximum at £ = 0, except for ideal 
cases; 

(b) normalization with respect to the maximum value of the impulse response 
h(t); this normalization is frequently used, but for severe channel distortion 
the maxima of the impulse response and of the eye opening: occur at instants 
of time too far from each other; 

(c) normalization with respect to A(t) for the instant of time where the eye 
opening has its maximum; from (85) we then have 


E; (to) = h(tg) — y lits m): 100%, (135) 


= 
A(to,opt) maz-o 
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b) — tr 


Fig. 22. Worst-case eye openings versus time fọ and phase angle yo for the shape factor of 
fig. 21 and (a) quadratic phase distortion, (6) cubic phase distortion. 


for values of tọ where E; (tg) > 0. 
Figure 24 shows the eye opening of fig. 23b after normalization according to 
possibility (c). The normalization factor 1/h(to,op:) = 1:0187. Figure 24 also 
shows E> (fo) and A*(tj) as well as the normalized intersymbol interference 
I*(to). The actual difference between the instants of time where the eye opening 
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Fig. 23. (a) The maximum worst-case eye opening and the maximum height of the impulse 
response versus the phase angle wo as obtained from fig. 22a. (b) The maximum worst-case 
eye opening and the maximum height of the impulse response versus the time to as obtained 
from fig. 225. 


is maximum and where the intersymbol interference is minimum is completely 
negligible. 


5. Error-probability curves 


From the theory of worst-case eye openings it follows that for a system 
having some positive eye opening, no error will occur in the detection process; 
for a system with a negative eye opening (closed eye), there might be a good 
transmission with errors only if worst-case or near-worst-case data sequences 
are present. Taking into account noise contributions, the situation changes and 
errors may also occur for systems with positive eye openings. In order to judge 
the final quality of a digital transmission system in the presence of noise one 
often uses the so-called error-probability curve (which gives a plot of the 
probability of error versus the signal-to-noise ratio, where the noise is taken 
to be gaussian noise). In this section we briefly dicsuss some methods of arriving 
at these curves, without going into detail. 

Let us therefore take an additive gaussian-random-noise process n(t) of zero 
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Fig. 24. Optimally normalized worst-case eye opening: impulse response and intersymbol 
interference versus time fo. 


mean and variance o,? at the detector input. The probability-density function 


of n(t) is given by 
( d ) 136) 
exp | — ; 
(22)? c, p 20,7 ( 


The signal input at the detector is given in (81). For an ideal two-level trans- 
mission system, the eye opening is equal to +/A(0) or —A(O) (with equal prob- 
ability) at the sampling instants KT (see sec. 3.1.1). Introducing a normalization 
with respect to A(0) (the same factor as in (135)) we can state that an error occurs 
if the noise contribution at the sampling instants becomes less than —1 or larger 
than +1. Because the probability-density function of the noise is symmetric, 
we thus can write for the error probability 


P, = Prob (n(kT) > 1). (137) 
With (136) we then obtain 


PAY) um 


CENSUM E ( E Napa «( IY — ap 
e = — | —exp| —— = 4 erfc : 
Q5 jx i , m) 


where 1/o,? is directly related to the signal-to-noise ratio; 
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erfc (x) = (2/|/n) f e-'? dt is the complementary error function *). 


From the above formulae it follows that it is quite easy to arrive at the error 
probability for an ideal transmission system, where, at the instants kT, no 
intersymbol interference is present (see e.g. fig. 115). For non-ideal transmis- 
sion systems (see e.g. fig. 12) some intersymbol interference is present at every 
instant of time. As we shall see, this makes it more complicated to find the 
error probability. In the preceding section, the instant of time at which the 
worst-case eye opening is maximum, was called /5,,,,. The intersymbol inter- 
ference at this instant of time, normalized with respect to A(/,,,), will be 
denoted i.s.i. (45,,,,). The error probability can then be written as: 


P, = Prob (n(to,op + KT) + is.ic(to,op: + KT) > 1). (139) 


The density function of the intersymbol interference, if it exists, will be denoted 
by pisi(y). It is a symmetrical function with respect to y = 0 (as assumed 
already in (139)). The new density function of the noise plus the intersymbol 
interference is then found by convolving the two density functions: 


Pro(¥) = f. Pa — Ë) Pisi(S) d£. (140) 
-%0 
The error probability then becomes 
o 
Pe = e») dy. (141) 


1 
Substitution of (136) and (140) into (141) and remembering that the inter- 


symbol interference is bounded within the interval [—7*, J*] (see fig. 24), 
yields 


bcd 
P-i[ ee (— 7p) Pu dé. (142) 


Thus the final problem is to find the density function of the intersymbol inter- 
ference, which, in general, is hardly possible in view of the infinite summation 
of which the intersymbol interference is constituted. A good approximation, 


*) Some authors use the normal probability aici 


. Qo) = f exp (— t2/2) dt 


(27 ast] 


rather than the complementary error function. The error probability then becomes 
P, = Q(1/o,). 
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however, can be achieved by truncating the infinite summation, as has been 
done in our discussion of worst-case eye openings. Taking into account 4 bits 
preceding and following the sampling instant to,op, the intersymbol inter- 
ference becomes from (82) and (135) 


A 
Y a, Mto o — nT), (143) 


i... (fo, pi) = 
d h(to opt) n=—A 


where a, = + 1 with equal probability. With 2A bits a number of 274 different 
sequences can be constructed, each having a probability of occurrence equal 
to 1/224. For each sequence j the intersymbol interference, denoted by 
i.s.i.(J, to,opt), has a discrete value in the interval [—7*, I*]. This means that 
each sequence j contributes a Dirac delta function to the density function. 
Hence the probability-density function of the intersymbol interference can be 


written as 
224 


1 
Pia) — — LY 9Q — i5... 10,92). (144) 
Jal 


224 


Inserting this approximated discrete density function into (142) yields for the 


error probability 
224 


1 1 — i.si.(J, to,o 
Pes uu ) e (E e) (145) 
On 


jz1 


For long sequences the calculation of (145) becomes a formidable task. Hill ??) 
introduced a slightly different approach, in which it is possible to decompose 
the sequences into short sequences, leading to much simpler calculations. 
Instead of calculating the error probability from expression (145) many authors 
have tried to find upper bounds for the error probability. The most obvious 
bound is of course the worst-case bound, based on the worst-case eye opening. 
With the foregoing theory it is easy to see that this bound can be written as 


2 
n 


1 — I** : 
P, «ie ( ) ; (146) 


As this bound is a pessimistic one, Saltzberg ?*) derived an upper bound for 
the probability of error, based on the Chernoff bound. This upper bound is 
much closer to the real error probability, because not only the worst-case inter- 
symbol interference is incorporated, but also the mean-square value (variance) 
of the intersymbol interference. Lugannani ?*) extended this method. Glave 7°) 
found another upper bound, based on the Chebyshev inequality. Here the worst 
possible density function for the intersymbol interference is determined. It turns 
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out that this worst possible density function consists of only three Dirac delta 
functions, which in view of (145) makes the calculation very simple. Mat- 
thews 7°) extended Glave's method. A quite different approach to finding the 
error probability has been developed by Ho and Yeh 27:28) and by Shimbo 
and Celebiler ??). Their method is based on the series expansion of the prob- 
ability of error. The result is that the error probability is expressed in terms of 
the moments of the intersymbol interference and Hermite polynomials. Refine- 
ments of this method have been made by Yeh and Ho °°) and by Prahbu ?!), 

The calculation of the error-probability curves, as mentioned so far in this 
chapter, have been done for the optimum instant of time toopt- For static 
deviations from this instant of time the same methods apply. In practical 
systems, however, the sampling instant will be perturbed by jitter, the so-called 
timing jitter. For the calculation of the error probability this means that the 
jitter density function should be incorporated. This gives a deterioration of the 
error probability as has been shown for a simple example by Zegers $2). 

Another parameter that influences the probability of error is the demodula- 
tion angle w(t) (see sec. 2). It seems to be a formidable task to take into account 
this angle y(t), which might be perturbed by jitter too (statistically dependent 
on the timing jitter or not so). Recently Smith 3%) did some investigations on 
the eye openings of class-4 partial-response data signals after transmission with 
a carrier-phase error. 


6. Conclusions 


It is shown that for linear data-transmission systems, such as base-band, 
AMSC, VSB, SSB and QAM, the impulse response can be obtained from two 
basic functions, the p(t) and q(t) functions. Except for base-band systems, the 
phase angle between the carrier wave at the transmitter and the carrier wave 
at the receiver determines how the p(t) and g(t) functions should be combined 
to obtain the impulse response. The signal-space diagram can also be construct- 
ed with the p(t) and q(t) functions. The formulae of the impulse responses and 
signal-space diagrams are set down in tables I and II. From the impulse response 
the eye pattern for two-level systems has been defined, as well as the worst-case 
eye opening (or eye opening), this being the inner envelope of the eye pattern. 
The definition of the eye opening has been extended to multilevel systems, coded 
systems and orthogonal (QAM) systems with independent or dependent chan- 
nels. For numerical calculations the DFT has been introduced as a convenient 
tool to obtain the p(t) and q(t) functions and hence the impulse response. Due 
to the infinite summation in the definition of the eye opening, its actual calcula- 
tion can only be an approximation. From the discussion on the approximation 
error it follows that in general it suffices to take into account a number of 50 
to 100 bits to obtain good results. Only for systems with very steep roll-off 
factors it is necessary to include more interfering bits. 
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It is shown that the maximum of the eye opening does not coincide with the 
maximum of the impulse response, unless the system is ideal and fulfils the first 
Nyquist criterion. Approximations of the time instant and phase angle at which 
the maxima occur can be found. However, it is shown that it is easier to search 
for the optima by direct numerical computations, rather than to seek approx- 

‘imate values, followed by a numerical search in their vicinities. 

Concerning the normalization of the height of the eye opening it is stated 
that a normalization with respect to A(fo,opt) (where to,opt is the instant of time 
where the worst-case eye opening is maximum) is preferred in order to express 
the maximum as a percentage. 

Worst-case eye openings have proved to be very useful to judge data-trans- 
mission systems in the presence of amplitude and/or group-delay distortions in 
the transmission channel. In the presence of noise in the transmission channel, 
however, one often refers to the probability-of-error curve. 

A brief discussion is given on the available methods. 


Eindhoven, July 1975 


REFERENCES 


1) P. J. van Gerwen and P. van der Wurf, IEEE Trans. on Comm. Techn. COM-18, 
214-222, 1970. 
2) E. D. Sunde, Bell Sys. tech. J. 33, 721-788 and 987-1010, 1954. 

3) R. W. Lucky, J. Salz and E.J. Weldon, Principles of data communication, McGraw 
Hill, 1968. 

4) H. Nyquist, AIEE Trans. 47, 617-644, 1928. 

5) R. A. Gibby and J. W. Smith, Bell Sys. tech. J. 44, 1487-1510, 1965. 

$) O. B. P. Rikkert de Koe and P. van der Wurf, Proc. IEEE 57, 701-702, 1969. 

7) J. R. Davey, Bell Sys. tech. J. 43, 2973-2983, 1964. 

8) A. Kok, Synthesis and analysis of synchronous data signals realised by means of trans- 
versal filters, Het PTT-Bedrijf 17, No. 1, Aug. 1970. 

9) H. C. van den Elzen and P. van der Wurf, IEEE Trans. on Comm. COM-20, 139-147, 
1972. 

10) A, Lender, IEEE Trans. on Comm. and Electr. 66, 214-218, 1963. 

11) P. J. van Gerwen, Philips Res. Repts 20, 469-484, 1965. 

12) E, R. Kretzmer, IEEE Trans. on Comm. Techn. COM-14, 67-68, 1966. 

13) K. H. Schmidt, Electrical Communication 48, 121-133, 1973. f 

14) A. Lender, Nonbinary correlative techniques for high speed data transmission, NEREM 
Record, Nov. 1968, Boston. 

15) R. J. Sluijter and P. J. van Gerwen, Electronics Letters 7, 640-642, 1971. 

16) J. W. Cooley and J. W. Tukey, Math. Comput. 19, 297-301, 1965. 

17) W. T. Cochran, J. W. Cooley, D. L. Favin, H.D. Helms, R.A. Kaenel, W. W. 
Lang, G.C. Maling, Jr., D. E. Nelson, C. M. Rader and P. D. Welch, IEEE 
Trans. on Audio and Electroacoustics AU-15, 45-55, 1967. 

18) J, W. Cooley, P. A. W. Lewis and P. D. Welch, IEEE Trans. on Audio and Electro- 
acoustics AU-15, 79-84, 1967. 

19) G. D. Bergland, IEEE Spectrum 6, July, 41-52, 1969. 

20) G. C. Groenendaal and L. Kristiansson, Private communication. 

21) R. W. Lucky, Bell Sys. tech. J. 42, 2427-2483, 1963. 

22) F. S. Hill, Bell Sys. tech. J. 50, 2055-2077, 1971. 

23) B. R. Saltzberg, IEEE Trans. on Information Theory IT-14, 563-568, 1968. 

24) R. Lugannani, IEEE Trans. on Information Theory IT-15, 682-687, 1969. 


WORST-CASE EYE OPENINGS IN DATA-TRANSMISSION SYSTEMS 435 


25) F. E, Glave, IEEE Trans. on Information Theory IT-18, 356-363, 1972. 

26) J, W. Matthews, IEEE Trans. on Information Theory IT-19, 440-447, 1973. 

27) E, Y. Ho and Y. S. Yeh, Bell Sys. tech. J. 49, 2249-2265, 1970. 

28) E. Y. Ho and Y. S. Yeh, Bell Sys. tech. J. 50, 1017-1023, 1971. 

29) i Shimbo and M. I. Celebiler, IEEE Trans. on Comm. Techn. COM-19, 113-119, 
71. 

30) y. S. Yeh and E. Y. Ho, Bell Sys. tech. J. 50, 2585-2598, 1971. 

31) v. K. Prahbu, Bell Sys. tech. J. 50, 3127-3151, 1971. 

32) L. E. Zegers, Philips Res. Repts Suppl. 1972, No. 4. 5 

33) B. M. Smith, IEEE Trans. on Comm. Techn. COM.-22, 696-698, 1974. 


R 934 Philips Res. Repts 30, 436-482, 1975 


RESISTIVE-INSULATED-GATE ARRAYS 
AND THEIR APPLICATIONS: 
AN EXPLORATORY STUDY 


by M. V. WHELAN, L. A. DAVERVELD and J. G. de GROOT 


Abstract 


A resistive-insulated-gate electrode with a voltage gradient applied along 
it has many uses. In this paper we present an exploratory investigation 
of a number of novel circuits in which the foregoing forms the backbone. 
We concentrate on three areas of application, namely steering circuits 
for electronic displays, analog-digital conversion and light-pattern 
scanning. Although we concentrate mainly on principles of operation, 
we also derive limits for salient characteristics, e.g. scanning speed, reso- 
lution. We do this with the aid of expressions based on certain simple 
models which have been verified by experiment. 


1. Introduction 


A resistive-insulated-gate electrode with a voltage gradient applied across it 
forms the basis of a number of novel circuits. These are simple to operate and 
can be made using standard silicon-gate technology. (Polycrystalline silicon has 
proved to be admirably suitable for the resistive gate.) One type of circuit for 
instance allows an analog voltage to regulate the length of a light column com- 
posed of a number (e.g. 10) of light-emitting diodes. Such a combination 
can be used as say volume, tone, balance indicators in radio and T.V. equip- 
ment. Such indicators are relatively cheap, reliable, robust, simple, and even 
necessary in remotely controlled equipment. 

Extensions of the foregoing simple type of array yield more-sophisticated 
types of analog-digital convertors which with present approaches already offer 
7-bit convertors with conversion times of 1 us and less. 

Resistive-insulated-gate arrays also yield light-pattern scanners. In particular 
line-type scanners of up to 100 elements have been realized. 

Because of the voltage gradient applied along the resistive insulated gate an 
inversion layer can be made to grow and contract under it by varying a voltage 
applied between the gate and the substrate. This can for instance be used to 
turn on and off IGFETs which share the gate electrode. A more fundamental 
use is considered in sec. 3.1. 

This paper is an exploratory investigation of three areas of application of 
resistive-insulated-gate arrays, namely, steering circuits for electronic displays, 
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analog-digital conversion, light-pattern scanning. Although we concentrate 
mainly on principles of operation of various resistive-gate configurations, we 
also derive limits of salient characteristics (scanning speed, resolution). We do 
this with the aid of expressions based on certain simple models which have been 
verified by experiment. We wish to emphasize that the dunt have not been 
optimised as regards packing density and layout. 


2. Analog-voltage switches and simple analog-digital convertors 


; 


2.1. Light-bar-indicator control circuit 


One very simple but useful application of a resistive gate is to allow an analog 
signal to regulate the length of a bar-type indicator composed of lamps, e.g. 
LEDs. A plan view of such an arrangement is shown in fig. 1a. It comprises a 
row of IGFETs with a common polysilicon resistive-gate electrode. (Depending 
on the application, the sheet resistivity can be chosen to lie between 15 and 
200 O/L] or larger.) A voltage V; is applied across the gate. The analog volt- 
age V, which is applied between one end of the gate and the substrate regulates 
the number of outputs (hence the number of lamps illuminated) which are con- 
nected by an inversion layer to the common input which is grounded (connected 
to the p substrate). The operation of the array can be understood by referring 
to fig. 1b. This shows the variation of the voltage (vertical axis) across the 


Inversion under gate 


Voltage 


V=0 EThreshold voltage 


——»- Distance along gate 


b) 


Fig. 1. (a) Plan view of a simple resistive-gate array for control of an indicator composed 
of a row of LEDs. 
(6) Voltage along the resistive gate for one value of V,. 
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resistive gate for one value of V,. Inversion exists under the gate where the 
voltage exceeds the threshold value Vr. Figure 3 shows a photo of an actual 
array similar in layout and design to the one shown schematically in fig. 1a: 
This device was suitable only for relatively small currents (W// = 2:5). It was 
originally designed for liquid-crystal displays. For devices of this type the spread 
in Vr from IGFET to IGFET and from chip to chip on the one slice was 
+ 30 mV. The spread in the sheet resistivity of the polysilicon across a two- 
inch slice was less than 6 %. 

Figure 2 shows the principle of a layout used to obtain arrays suitable for 
relatively large currents (mA) which are needed for displays using LEDs. The 
meander structure yields in a small area a large width-to-length ratio for the 
IGFETs. In the arrangement of fig. 1a each n* region is gradually connected 
to the common n* region (shown earthed) as the inversion layer is made to 
advance under the gate by V,. This gradual turning-on is caused by the voltage 
dropped along the resistive gate across the width (parallel to the long axis of 
the gate) of each n* region. In practice we would prefer to avoid this gradual 
turning on and off. This effect becomes even more acute when we desire a large 
width-to-length ratio for each IGFET. The arrangement of fig. 2 avoids the 
foregoing problem, since the voltage drop per active region of each IGFET 
now only occurs across the relatively small distance between the source and 
drain. 

Figure 4 is a photograph of a chip made using the foregoing layout principle. 
It shows a row of 12 IGFETs. Each has a width-to-length ratio of 1000 (gate 
width is 5-5 mm). The chip is 2x2 mm?. The array is suitable for control of a 
display composed of LEDs. The gate oxide is 1000 À thick: the transconduct- 
ance is 15 mA/V: threshold voltage 0:3 V. The resistance between the ends of the 
polysilicon gate used to distribute V; between the 12 IGFETs is 10 kQ (sheet 
resistivity of the poly 20 Q/D). At the expense of an extra mask this resistance 
can be increased to a much larger value, independently of the meander gates 
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Fig. 2. Part of a layout for a resistive-gate bar-indicator control circuit suitable for use with 
LEDs. i : : : à 
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Fig. 3. Photograph of a 
simple-resistive-gate bar- 
indicator control circuit 
suitable for small-current 
applications — (width-to- 
length ratio of each 
IGFET is 2:5). 
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Fig. 4. Photograph of a chip which is suitable for control of a bar indicator composed of 
12 LEDs. The layout is similar to that of fig. 2. The width-to-length ratio for each of the 
12 n-channel IGFETs is 1000; resistance between the gate ends is 10 KO (onec, ~ 20 9/0). 
Chip size is 2: 2 mm?. Source-to-drain distance is 5:5 uum. 


of the individual IGFETs. The relatively long length of the meander gates of 
the individual IGFETs, even for a sheet resistance of 20 O/[ ], places a speed 
limitation on the circuit. However, for most applications, e.g. indicators in 
radio and video equipment, this speed limit exceeds that required. A quanti- 
tative estimation of the upper frequency limit for the meander gate can be 
made with the aid of the expression in eq. (2.1). 

The spread in threshold voltage for the row of 12 IGFETs across a two-inch 
slice was less than + 15 mV and the variation of the resistance (10 kQ) be- 
tween the ends of the resistive gate was less than 6%. Satisfactory operation 
of a display of 12 20-mA LEDs is obtained with a voltage across the resistive 
gate of nine volts. If 5-mA LEDs are used three volts gives satisfactory results. 
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2.2. Pointer circuit 


For certain applications, e.g. station indication in radio equipment etc., a - 
circuit is required which causes only one of a number of lamps of a row to 
illuminate. The lamp chosen depends on the value of an analog signal applied 
to the selection or pointer circuit. We now consider such a circuit. This can be 
obtained by a twin-gate arrangement *) as illustrated schematically in fig. 5a. 
The operation of the arrangement of fig. 5a can be understood by referring to 


Overlop 


E: Threshold 
voltage 


——— Distance along gate 


b) 


Fig. 5. (a) Layout of a twin-resistive-gate selection system. It allows e.g. only one lamp to 
illuminate for each value of V;. 
(b) Voltage variation along the resistive gates of the foregoing layout. 


fig. 5b. This shows the voltage across both resistive gates for one particular 
value of input voltage. The voltage V, regulates the overlap of the inversion 
layers under the resistive gates. For the case shown only one lamp at a time 
is allowed to illuminate. When the analog voltage changes, so does the position 
of the overlap and hence the làmp which illuminates. À 

In principle, for the case shown in fig. 5, channel stoppers should be used 
between the IGFETs corresponding to gate C. Otherwise for instance, current 


*) While studying the above arrangement we came across a proposal, by Kabell!)}, of a 
twin-gate arrangement in a light-pattern scanner. 
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could also flow in IGFETs d, e, etc. of gate C via the inversion layer under 
gate C, the source of IGFET c, andvia IGFET c of gate D. In practice, however, ` 
the oxide between the IGFETs of a row is much thicker than that between the 
source and drain regions of each IGFET. For instance for the example in fig. 5a, 
although IGFET c is conducting, no inversion will exist under gate C between 
IGFETs c and d. This fact eliminates the need for diffused channel stoppers 
between the IGFETs, which simplifies the process and results in higher packing 
densities. l 

A photograph of an actual twin-gate chip is shown in fig. 6. This device was 
intended for low-current applications. The layout is similar to that of the single- 
gate array of fig. 3 and similar results as regards spread in threshold voltage 
and gate resistance were obtained for this array. 

One interesting aspect of the twin-resistive-gate arrangement is selectivity and 
its dependence on V, (see fig. 5). Some preliminary results were obtained as 
follows. The biases were adjusted so that the overlap between the inversion 
layers was centred at say position c, fig. 5a. A bias was next applied to ter- 
minals b, c and d, and the current to the earthed common source was measured. 
We define the selectivity to be high when the ratio of the current at terminal c 
to that at either of the adjacent terminals is large. This ratio was measured for 
different values of V4. For each value of V3 the input was adjusted to centre 
the overlap region at position c. The measured results are shown in table I. 
As can be seen the ratio of the currents can be as large as 4 . 10? ((2:5 . 107471). 
Also noticeable is that there is an optimum setting of V4. As one would expect, 
a large value of V, yields a large overlap and relatively large and practically 
equal currents can flow through the three terminals. On the other hand, when 
no overlap occurs, no current can flow. Hence for some intermediate value of 
overlap the selectivity will be a maximum. Figure 7 shows a measured plot, for 
a twin-gate arrangement, of the variation of current flow through the terminals b, 
c, d, versus the same voltage applied to both gates and in the absence of a 


TABLE I 


Measurements for a twin-resistive-insulated-gate system illustrated in fig. 5a. 
The influence of V, is shown. For each value of V3 the voltage V, was ad- 
justed to centre the overlap near region c so that 1, e I, < J, 


43.107? 
e 07.107: 
0-7 . 107? 
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Fig. 6. Photograph of 
a twin-resistive-insulated- 
gate selection chip, suit- 
able for small-current 
(uA) applications (each 
IGFET has a width-to- 
length ratio of 2-5). 
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Fig. 7. Current versus gate voltage for the common n* region and each of three outputs 
for a double-resistive-gate system. The connections used are illustrated in the inset. The 
different symbols are for the three different outputs. 


voltage gradient along either gate. These plots show that the effective drain-to- 
source current for each double IGFET (two gates, IGFETs in series) varies 
exponentially *) with gate voltage before the onset of the well-known quadratic 
region. We have found that for a given value of V, (i.e. voltage across the 
resistive gates) that maximum selectivity combined with maximum current- 
handling capacity corresponds to the double IGFETs at the overlap region 
(e.g. b, c and d in fig. 5a) being operated in the exponential part of their J/V 
curves (fig. 7) just below the onset of the quadratic region. 

The device just described and the results presented are preliminary ones: the 
object was to investigate the feasibility of the twin-gate arrangement for selec- 
tion. We show in the next section that this technique can be used to select from 
as many as thirty-two levels. Some solutions to drawbacks such as the necessity 
of three supplies (fig. 5a) are being studied and will be considered in a later work. 


2.3. Selection- RO M-type analog-digital convertor 
2.3.1. Principle of operation 


The layout of an analog-digital convertor is shown in fig. 8. À twin-resistive- 


*) Such an exponential behaviour has been considered by other workers ?:9). 
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V Q^] Double-gate Selection system 
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Fig. 8. Schematic double-gated five-bit analog-digital convertor. 


insulated-gate arrangement selects in a read-only memory an address line whose 
position depends on the value of an analog input to the twin gates. Depending 
on the address line chosen a certain combination of ones and zeros will appear 
at the output lines of the ROM. As illustrated in fig. 8 each address line cor- 
responds to an insulated gate which is shared by a number of IGFETs. The 
output lines of the ROM are perpendicular to these common, gate or address 
lines. If a one is desired at an output line, this line is allowed to contact the 
drain of an IGFET. The sources of all IGFETs whose drains are contacted are 
connected to earth as shown in fig. 8. Also shown is a so-called sampling gate; 
this is an insulated gate (not resistive) which is common to a row of IGFETs. 
These have a common source which is shorted to earth. The drains are separate. 
When the sampling gate is turned on, the address lines are connected to the 
common source of the sampling gate and are hence shorted to earth. This 
prevents any address line being turned on by the twin-gate arrangement so that 
only when desired, i.e. when the sampling gate is turned off, will an output be 
obtained from the ROM. 

The overlap of the inversion layers in the twin-gate arrangement is adjusted 
so that preferentially one, but never more than two lines of the ROM can be 
addressed at any instant. Unless this is done no output will be obtained from 
the ROM for those analog input voltages which correspond to the overlap 
located between two address lines. This is not allowable since it implies zero 
digital output for a finite analog input. On the other hand, although we can 
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Fig. 9; Photograph of a double-gated five-bit analog-digital convertor. 
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avoid this problem by proper adjustment of the overlap, another one arises, 
namely, for those analog voltages which correspond to two lines being ad- 
dressed. If a binary code is contained in the ROM, the digital output in the 
foregoing case, ie. two lines addressed, will not correspond to the analog 
voltage. We greatly reduce such an error by using instead of a binary, a Gray 
code in the ROM. 

Figure 9 shows a photograph ofa chip (3 x 1-5 mm?). This device was designed 
primarily to check the principle of operation of the convertor. In future designs 
we can realize the same function with improved conversion speed and selectivity 
in a chip smaller than 2-5x 1 mm?. The device of fig. 9 is a five-bit A/D con- 
vertor based on the principle outlined in the last paragraph. To the left are the 
twin gates with 32 levels; the resistance between the ends of each gate is 6 KQ. 
A voltage of 12 to 15 volts across each gate enabled each of the 32 levels to be 
selected. Smaller voltages are possible for thinner gate oxides and smaller source 
to drain distances. We used respectively 1100 A and 8 ym. The centre part of 
the photograph shows the ROM; the width-to-length ratio of these IGFETs 
is 5. The sampling-gate arrangement is to the right; the width-to-length ratio 
for each IGFET in this array was 10. Í 

Some preliminary d.c. measurements which indicate the feasibility of the twin- 
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Fig. 10. Contents of the ROM of the five-bit A/D convertor of fig. 9. Also shown is the cur- 
rent which was measured for each output line (a^, b’, c’, d’, e^, f^ of the ROM when the 
memory was scanned by means of the double-gate system. 
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gate selection system are shown in fig. 10. Shown as a function of the total 
analog input voltage is the variation of the current which flows into each of the 
output lines of the ROM in turn, as the twin-gate system scans the memory. 
The voltage drop across each of the resistive gates was 15 volts. The major 
peaks on the output memory line characteristic correspond to groups of ad- 
jacent IGFETs in the memory. The minor humps superimposed on these major 
ones are due to individual IGFETs in the memory. The results show that there 
is sufficient difference indeed between ones and zeros to be able to detect the 
contents of the memory. The performance as will be shown in later work can 
be greatly enhanced by the use of amplifiers between the resistive-gate selection 
circuit and the address lines of the memory and by the use of pulse-shaping 
circuits at the output lines of the ROM. 


2.3.2. Conversion speed 


We will now consider two possible limits to the speed of operation of the 
foregoing type of A/D convertor. 

One speed limitation is caused by the resistive gates and the associated 
capacitance due to the underlying oxide. These form a distributed RC network 
with its associated frequency limitations. As we will show in the following sec- 
tion an upper frequency limit can be defined which is 


LONE 21) 
IP os Cox. l 


L is the length of each of the resistive gates, o, is the sheet resistivity of the poly- 
silicon, and C,, is the capacitance per unit area of the oxide under the gate. For 
the device shown in fig.9, o, —21 Q/O, L —2:57 mm, Cox = 3:15. 107° F/em?, 
which upon insertion into eq. (2.1) yields f = 1:2. 10° Hz. 

We will now consider the other speed-limiting factor, namely that associated 
with the charging of a ROM line to its limiting voltage. The situation we con- 
sider is illustrated in fig. 11. It shows the sampling gate turned off, i.e. it is 
grounded. The problem is to estimate how quickly the line Grom can be charged 
via the overlapping inversion layers to some limiting potential. The problem 
can be reduced to that represented by the simple equivalent circuit in fig. 11. 
The IGFET M, of effective channel length 2Z represents the overlap of the 
inversion layers and the resistive gates at that point; M, represents an IGFET 
from the sampling array, it is not conducting. The capacitance of the ROM 
address line Grom with respect to earth is represented by Crom- For simplicity 
we assume this to be independent of voltage and to be determined by the oxide 
thickness under Grom- 

In appendix B we derive with the aid of standard IGFET theory the fol- 
lowing relationship: 
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a) i ROM address lines 


b) 


Fig. 11. Single line only being selected in the ROM of A/D convertor of the type shown in 
fig. 8. The sampling gate is shorted to earth, and the equivalent circuit for the charging of 
the selected ROM line is also shown. 


B t. a (& Vem + Vo)* — Vot 


^ 


4 Crom y? —4À (a+ V+) [a + (æ Vom + Vo)*] 


(2.2) 


The time ¢, is that needed to charge the capacitance Crom to some fraction « 
(1 > « > 0) of its final voltage V,m. The other quantities are defined in ap- 
pendix B. 

Let us now consider a numerical example and use figures typical of the 
device shown in fig. 9. The parameters are Vro = 0-3 V: N, = 1015 cm^?; 
Crom = 8:95 . 107? F (length of ROM address line over thin oxide is 600 um, 
line width is 10 um, oxide thickness 0-12 um, dielectric constant of the oxide 
3:8 . 10-1? F/cm); 8 = 1-26.10-* (u œ 800 cm? V-! s-1, oxide thickness 
0-12 um, effective ratio of gate width to channel length for the twin resistive 
gate — 5). For V, — 1 (the effective gate voltage at the point of overlap of the 
inversion layers) we obtain, using eq. (B.2), Vem = 0:6 V. Hence the IGFETs 
associated with ROM address lines can be charged to 0-3 V above their thresh- 
old value of 0-3 V (the sources of these IGFETs are grounded). With the aid 
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of the foregoing parameters and eq. (2.2) we obtain 
t, = 0:14 . 1076 s. (2.3) 


In other words, it takes 0-14 . 1076 seconds to charge the ROM address line 
to 90% (x = 0-9) of its final value of 0-6 V. 

There still remains the problem of estimating the voltage needed across the 
resistive gates to correspond to the foregoing results. We now estimate this. 
We require that the adjacent ROM line on either side is not charged to a 
voltage which exceeds 0:3 V, otherwise the IGFETS associated with these lines 
will be conducting. If we take 0-2 V as limit to which the adjacent lines may be 
charged then eq. (B.2) yields (Vi, now is 0-2 V) an effective gate voltage of 
0-57 V. Hence we need a voltage drop of 1 — 0-57 = 0-43 V along the resis- 
tive gates between the IGFET pair at the point of overlap of the inversion 
layers, and each of the adjacent IGFET pair on either side. This corresponds 
to a total voltage across each resistive gate of 13-7 V. 

On the basis of the calculations of this section at least for the device pre- 
sented in fig. 9 the limiting factor in the speed will not be the resistive gate 
with its associated oxide capacitance, but rather the speed with which a ROM 
address line can be charged to a sufficiently large voltage to turn on the IGFETs 
common to this line. The discharge speed of the ROM line is not considered 
as a limiting factor since the IGFETs in the sampling array, and responsible 
for the discharge, can be heavily turned on. On the other hand the IGFETs 
responsible for the charging of the ROM line are subject to limitations coupled 
with the maximum allowable voltages which can be applied across the resistive 
gates. 


2.3.3. Influence of leakage currents 


In some samples of the type shown in fig. 9, we failed to obtain an output 
when certain lines of the ROM were addressed. We eliminated by probe testing 
the possibility of breaks in the connecting paths between these lines and the 
outputs of the double-gate system. Eventually we discovered that the problem 
was due to leakage currents of diodes connected to the ends of the address 
lines. As an address line of a ROM is charged up via overlapping inversion 
layers in the twin-resistive-gate system, the leakage current of the diodes (source 
to bulk diode of a resistive gate IGFET, and drain to bulk diode of a sampling 
gate IGFET) connected to the address line increases, while the current which 
can be delivered by the double-gate system decreases. Equilibrium is reached 
when the charging and discharging currents become equal. The final value of 
the voltage to which the address line can be charged will thus be limited by the 
leakage current of the diodes, and this voltage can be less than the threshold 
voltage of the IGFETs of the ROM associated with the address line. 
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2.3.4. Some concluding remarks 


Although the foregoing twin-gate system has the advantage of simplicity of 
construction and flexibility to the extent that the overlap between the inversion 
layers under the gates and hence the selectivity can be adjusted simply by ad- 
justing an external voltage, the system has some drawbacks. Firstly, two resis- 
tive gates are needed together with their associated driving voltages some of 
which are floating. Secondly, the need for an extra voltage to adjust the selec- 
tivity, particularly when it is desired only to select one line of the ROM ata time, 
is undesirable. Thirdly, the definiteness with which an address line of the ROM 
can be selected and the speed with which it can be charged to a desired voltage 
increase with the magnitude of the voltages applied across the resistive gates. 
These voltages can be relatively large, and since the variation of the input signal 
needed to scan the whole of the ROM is directly related to these voltages, this 
means an increase of selectivity, and speed necessitates a corresponding increase 
in input signal. A fourth drawback is that in order to drive the twin-resistive- 
gate system the input signal has to be split, one half being applied to one gate, 
the other half to the other gate. In later work we will dwell more deeply on these 
problems and present solutions to them. 


2.4. Pulse-train serial-type analog-digital convertor 


2.4.1. Introduction 


Another type of analog-digital (A/D) convertor is illustrated schematically 
in fig. 12. It consists of a single-resistive-gated array and a binary counter. 
Briefly the arrangement works as follows: a voltage is applied across the resis- 
tive gate, a triangular wave between the gate and the substrate. During each 
period of the latter, for reasons we will explain later, a chain of pulses will 
appear at the output of the resistive-gate array across the resistance R. The 
number can be regulated by an analog signal in series with the triangular wave. 
Hence, if the pulse chain is fed to the counter the output of the latter yields a 
binary equivalent of the analog signal. The counter is automatically reset to 
zero after each cycle of the triangular wave. The analog signal varies at a rate 
which is small compared to that of the change of the triangular wave, i.e. during 
each period of the triangular wave one analog sample is obtained. Using the 
foregoing arrangement and a resistive-gate array which yielded a maximum of 
32 pulses, a 5-bit A/D convertor was successfully bread-boarded *). The func- 
tion of the compensation gate in fig. 12 will be considered later; briefly, it is 
used to suppress an undesirable capacitive signal contained in the output pulse 
train. 


*) We are grateful to R. P. R. Scaife who performed this work during a summer recess 
from Trinity College Dublin. 
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Fig. 12. Operational diagram of an A/D convertor using two resistive-gated arrays. One 
is to compensate an undesirable capacitive current in the active array. The p* regions 
(100 x 100 um? separated. by 30 um from each other and 20 zm from the common p* region) 


shown dashed lie under the resistive gate but are insulated from it by an oxide layer.The 
inset indicates qualitatively an output pulse train. The number of pulses is determined by 
the analog signal. The counter is reset by the positive peak of the triangular wave. 


Since the basic unit in the foregoing A/D convertor is the resistive-gate array, 
in the remainder of this section we confine our attention solely to this element. 
We explain its mode of operation, present a simple model to explain the shape 
of the output pulses. We then consider some limitations and improvements. 
Finally we derive the relationship between for instance maximum practical 
speed of operation and parameters such as device dimensions and voltage gra- 
dient across the resistive gate. 

Many of the considerations of this section also apply to the light-pattern- 
scanning units considered in sec. 3. 


2.4.2. Qualitative explanation of the operation of the resistive- 
gated array 


We will now qualitatively explain the operation of the resistive-gated array 
illustrated in fig. 13. It comprises a row of p* regions which are covered by a 
resistive-gate electrode (polysilicon), but are insulated from it by a relatively 
thin oxide layer (1000 À). A single p* read-out region is parallel to the row, 
and one of its edges just extends under the gate, but is insulated from it by an 
oxide layer. A voltage gradient (~ 30 V/cm) is applied across the gate by V2. 
A triangular wave is applied between the gate and the n substrate. For the 
moment let us choose this so that during its period the whole of the gate becomes 
positive with respect to the bulk and subsequently sufficiently negative to invert 
the n silicon under the whole of the gate. The voltage distribution across the 
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Fig. 13. Resistive-gated array which yields an output pulse train across R when driven bya 
triangular voltage wave V. Also shown for different values of V, is the voltage distribution 
across the resistive gate. 


gate at various times during a period of the triangular wave is indicated in 
fig. 13. As the gate becomes positive with respect to the underlying silicon (this 
occurs during the part of the triangular period between 0 and [2], see fig. 13) 
then holes will be driven out of the p* regions under the gate and into the sur- 
rounding n substrate where they recombine with electrons. 

In more detail what happens is the following. Part of the positive voltage 
between the gate and the substrate appears across the p*-—n junctions and for- 
ward-biases them. This allows holes to be injected into the n substrate (we 
neglect the small amount of electrons injected from the n substrate into the 
p* region). A negative charge of exposed acceptor atoms will be left at the 
oxide-silicon interface. This compensates the positive charge at the gate side 
of the oxide. The flow of holes from the p* regions will continue as long as 
the gate voltage continues to increase, the magnitude of the current is propor- 
tional to the rate of change of the gate voltage. When the gate voltage ceases 
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to change, the hole currents will drop to zero and the whole gate voltage will 
appear across the oxide and a surface space-charge layer at the silicon-oxide 
interface. In the case of the p* regions this latter component of the voltage is 
negligible. Summarizing, because the pt-n junctions become forward-biased 
when the gate voltage is increasing, the oxide capacitance between the gate and 
each p* region will be charged to more or less the full voltage between the gate 
over each p* region and the n substrate (of course the oxide capacitance be- 
tween the gate and the n substrate will also be charged). 

When the triangular wave begins to decrease ([2] to [4], fig. 13), in the case 
of the gate over the n bulk there is no problem. In the case of the p* regions 
the situation is different. Reduction of the gate voltage also requires a net flow 
of positive charge to the p* regions. This can be achieved either by electrons 
leaving the p* regions or by holes flowing into them from the surrounding 
n material. Since both carriers are minority ones in their respective regions the 
demand cannot be met, in fact the ptn diodes are in their blocking condition. 
The situation for each p* region can now be represented by two capacitances 
in series with the voltage between the gate and the n substrate being applied 
across them. One capacitance is formed by the oxide, the gate and the p* region, 
the other capacitance is due to the junction between the p* region and the 
n bulk. This latter one is voltage-dependent. This second capacitance is intro- 
duced into the circuit at the moment the gate-to-substrate voltage begins to 
decrease from its maximum value: at the same moment the oxide capacitance 
has been charged to this voltage and the other capacitance has no voltage 
applied across it. As the gate voltage reduces from its maximum value, a voltage 
will develop across the p*n -junction capacitance. This is such as to reverse-bias 
the junction (we neglect the small rate of holes supplied to a p* region due to 
leakage currents caused by this reverse bias). The sum of the voltages across 
the two capacitors equals that applied between the gate (covering the p* region 
being considered) and the n substrate at any moment. The net amount of 
charge on the plate (p* region) shared by both capacitors remains equal to that 
induced when the maximum positive voltage was developed across the oxide 
capacitor, just before the gate voltage began to decrease and before the p*n- 
junction capacitance was introduced in series with this oxide capacitance. As 
the gate voltage continues to decrease, eventually the situation indicated by 
condition [3] in fig. 13 will be reached. By referring to the lower part of fig. 13, 
which shows the voltage along the gate for various values of triangular voltage 
it can be seen that inversion is about to occur under the gate at end A. When 
inversion occurs, holes will be able to flow from the read-out region (grounded 
via R). The effect of connecting a p* region by an inversion layer to the read- 
out region is, to short-circuit the p*-junction capacitance by means of R, and 
to allow the oxide capacitance (between the p* region and the gate) to be 
charged to the voltage which exists between the gate and the substrate where 
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the p* region is discharged. The flow of holes via the inversion layer as we 
explain in more detail later, is governed by IGFET characteristics. The flow 
of holes to a p* region when it is connected by an inversion layer to the read- 
out region causes a signal to appear across the resistance R. As the triangular 
voltage continues to decrease, other p* regions will be sequentially connected 
by a growing (grows from end A for the case shown in fig. 13) inversion layer 
to the read-out region. A chain of pulses will thus appear as a function of time 
across the resistance R. 

The amount of charge which flows to each p* region to discharge it, depends 
on how positive the gate over it becomes during the cycle of the triangular 
wave, and on the magnitude of the threshold voltage. Referring to fig. 13, the 
charge flow to discharge a p* region at position x is proportional to the voltage 
difference ac. It can be seen that a different amount of charge will be needed 
to discharge each p* region. Those near end B demand the most, those at end A 
the least. Later we indicate methods to obtain equal amounts of charge flow 
to each region in a row. 


2.4.3. Discharge-pulse theory and comparison with experiment 


A simple IGFET theory has earlier been shown 5) to be adequate to describe 
the behaviour of the foregoing type of arrays. Before outlining this theory and 
presenting some experimental results the background to this theory will be 
presented. 

Figure 14a shows part of a resistive-gate array similar to the one of fig. 13. 
For simplicity we assume that inversion is now growing under the gate from 
left to right. We are interested in current flow to the arbitrary p* region h via 
the advancing inversion layer. This region h has been charged in the manner 
described in the previous section. We assume that region g is completely dis- 
charged before the discharge of h begins. We further assume that for this reason 
the current responsible for the discharge of h will flow via region g across to h 
as shown indicated. We assume the current flow directly between h and the 
common read-out p* region to be negligible simply because the channel or 
inversion layer will be present between g and h before it occurs between h and 
the common read-out region. The current which flows to h (as the gate voltage 
continues to change because the triangular voltage in fig. 13 changes) will cor- 
respond to that of the IGFET which consists of g as grounded source region, h 
(before the discharge begins) as reverse-biased floating drain: for gate voltage V, 
we use an average value for the potential of the gate between g and h with 
respect to the substrate; because of the voltage gradient along the gate due 
to V, (see fig. 13), the voltage along the gate is nowhere alike at any moment. 
Characteristics for the foregoing IGFET are sketched in fig. 14b. The vertical 
axis is for current which can flow between the reverse-biased floating region h 
and the already discharged region g. The horizontal axis represents the reverse 
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Fig. 14. (a) Part of a resistive-gated array of the type shown in fig. 13; also shown here is a 
qualitative sketch of the discharge peak for one p* region of the array. 

(b) Qualitative indication of IGFET characteristics of the discharge path for a p* region 
for three different sets of conditions. 


bias on region h. The quantity V, is the average gate voltage over the region 
between g and h, this is a function of time. With the aid of these characteristics 
we will now qualitatively indicate the influence of various factors on the dis- 
charge of region h. The point p indicates the beginning of the discharge of h. 
The potential of h with respect to the substrate at that moment is o-p. As 
current begins to flow to h, because the gate voltage is changing with time, 
the reverse bias on h will decrease. As the gate voltage changes the discharge 
current will increase, and the reverse bias on h will drop even more rapidly. 
The total discharge path will look something like pao, from p to a the 
IGFET operates in its saturation region, from a to o in its linear region. The 
path pa'o for instance indicates the effect of increasing the rate of change of 
the gate voltage V, (i.e. increasing the frequency of the triangular wave V, in 
fig. 13). The curve p'a"o indicates the influence of increasing the positive swing 
of the triangular wave (the larger the positive swing the more holes which are 
pumped out of each p* region, and the larger the reverse bias on the p* regions 
just before discharge begins). 

The time dependence in the foregoing cases is introduced by the gate volt- 
age V, whose rate of change is determined by that of the triangular wave V, 
(fig. 13, [3] to [4]). 

Not evident from the foregoing is the influence on the discharge characteristic 
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of the size of the p* regions. For the case shown in fig. 14a, increasing the 
dimension of the p* region, in a direction parallel to the current flow responsible 
for the discharge of h, causes an increase of the oxide and junction capacitances 
associated with region h. This means more holes have to flow to region h to 
discharge it and charge the oxide capacitance to the gate voltage over the p* 
region. For instance if pao represents the discharge path for a particular size 
of h, then pa'o would represent the effect of increasing the size of h in the 
direction just mentioned (all other quantities such as rate of gate-voltage change, 
etc. being otherwise alike). 

Although the foregoing considerations are qualitative ones they indicate cer- 
tain important features which enable us to establish a simple model for the 
discharge of a p* region. A sketch of the form of an actual measured discharge 
peak for an arbitrary p* region is shown in the top right-hand corner of fig. 14a. 
The point p indicates the beginning of the discharge. The time from the be- 
ginning of the discharge to the peak value is indicated by t’. It represents the 
major portion of the duration of the pulse. The maximum of the discharge is 
indicated by Imax. We will use these two quantities to characterize a discharge 
pulse. The qualitative considerations with respect to the IGFET characteristics 
indicated that up to the peak value of the discharge current the IGFET operated 
in saturation. The peak of the discharge is reached when the IGFET enters the 
linear part of its characteristics. Simple IGFET theory indicates that this occurs 
when the potential of the drain (region h) equals the gate voltage V, less the 
threshold voltage Vyro. The discharge begins when LA > |o]. The time 
which elapses from the beginning of the discharge until the peak value is t’. 
The maximum discharge current can then be obtained from the expression for 
current flow by inserting the value of V, at this peak value. 

The elements of the simple IGFET theory which we used to quantitatively 
describe the discharge of p* regions are as follows. 

The current flow to a p* region while the IGFET operates in its saturation 
region is 

T+ = (B/D) [V (0) — Vro, — B = u Ca slk; (2.4) 


C, is the oxide capacitance per unit area over the source-to-drain region 
(between g and h, fig. 14); 

- Fro is the threshold voltage (grounded source), 

V(t) is the gate voltage midway between two adjacent p* regions (g and h, 
fig. 14) between which 7,. flows. The time dependence of V, is deter- 
mined by that of the triangular wave V,, [3] to [4] in fig. 13. 

The time-dependent voltage of the p* region (h in fig. 14), whose discharge 

is being considered, with respect to the n substrate is l 


Ie dt 
V+) = [7——-— v0. (2.5) 


o 
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This is valid for the reasonable assumption that 
Co > Cp+n; 


Co is a capacitance determined by the thickness of the oxide between the 
gate and the p* region whose discharge is being considered, and by the 
area of the gate which overlaps that p* region; 

Cp+n is the junction capacitance of the p* region with respect to the n bulk; 
its value depends on the reverse bias across the junction: independent 
of the value of this bias we assume that Cy > C,«,; 

V,+(t) denotes the time-dependent reverse bias on the p* region in the event 
that no holes were allowed to flow to the p* region. Electrostatic con- 
siderations enable this to be calculated in terms of the positive peak 
voltage reached by the gate over the p* region, and the value of the 
gate voltage at the moment of discharge. 

The time t’ defined earlier, see fig. 14a, is defined as the value of ? when 
V',«(t' becomes equal to V,(t') — Vo. The zero of t is defined when 
V,(0) = Vro. 

The justification for the use of the simple IGFET model is found in the rather 
good agreement between calculated and experimental values shown in table II. 
This table shows the results of a comparison between theory and experiment as 
regards the salient features of a discharge pulse for two arbitrary p* regions 


TABLE J 


Comparison between theory and experiment for the output pulse of a resistive- 
insulated-gate array of the type shown in fig. 13. The pulses corresponding to 
two different positions (values of x) along the gate are compared. The array 
had 32 p* regions, voltage gradient along the gate was 32 V/cm, line scan 
time 2:5 ms, peak-to-peak value of the triangular driving voltage (V,, fig. 13) 
was 32 V. The p* regions under the gate were 100x100 um?: they were 
separated by 20 um from the common read-out region. k (see fig. 14) was 
30 um. Minimum useful line scan time 0-25 ms 


6°5 . 1075 
6:65 . 1075 


experiment 
theory 


experiment 
theory 
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in an array. The array considered and the conditions corresponding to the 
results are also indicated in the legend to the table, as is the calculated minimum 
scan time for the array. This quantity which agreed quite well with experiment 
was estimated as follows. From the rate of change of the triangular voltage V, 
we calculate r, the interval between the addressing of two adjacent p* regions. 
Using the simple IGFET model we calculate t’ (see fig. 14a). For proper opera- 
tion of the array, the interval t must be greater than t’. We choose the fre- 
quency of V, so that this is the case. The line scan time can then be deduced. 


2.4.4. Undesirable inversion-layer capacitive signal 


Figure 15 shows a copy of an oscilloscope trace of the output from a resistive- 
gated array of the type illustrated in fig. 13. The device had 32 p* regions under 
the resistive gate. Noticeable in fig. 15 is the unequal pulse height, which we 
consider later, and also that the output pulses are superimposed on a signal 
which increases linearly with the length of the p-type inversion layer. This signal 
is due to a capacitive current and is caused by the flow of holes needed to supply 
the growing inversion layer. This current has been considered extensively else- 
where 7). We suffice here by pointing out that it can be eliminated from the 
output as indicated in fig. 12 by using a second gate of the same dimensions 
but with no p* regions under it, and only the difference signal from the two 
arrays is considered. Alternatively a high-pass filter can be used to eliminate 
the inversion-layer-movement current from the output of a single resistive-gated 
array. 


Current 


——»- Time 


Fig. 15. Copy of oscilloscope trace of the output obtained from a resistive-gated array of 
the type shown in fig. 13. The array had 32 p* regions; relevant dimensions are given in 
fig. 14a. The vertical scale is 2 wA/div., horizontal is 1 ms/div. 


2.4.5. Unequal pulse-height elimination 


It was pointed out in sec. 2.4.2 that the amount of charge which flows to 
each p* region under the resistive gate as each is sequentially discharged is not 
alike for each region. One solution we used for this problem is as follows. At 
the expense of an extra technological step, we made arrays with p* regions 
under the gate, which regions had a lowered doping level near the silicon- 
oxide-silicon interface. This meant that where the gate voltage became more 
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positive than some moderately large value V;,, p+ inversion occurred at the 
surface of the p* regions under the resistive gate. Thus where the gate voltage 
exceeds Viny p+ the negative charge at the surface of the p* regions will be 
due (a) as before to acceptor atoms exposed because holes have been injected 
into the n bulk, and (b) to negative charge due to an inversion layer of elec- 
trons which can flow easily to and from the surrounding n bulk. We refer to 
fig. 13 and consider a p* region at position x. As the triangular driving volt- 
age V, decreases from its maximum value, and the voltage at x drops from ea 
to eb, negative charge in the inversion layer at the surface of the p* region there 
will flow back to the surrounding n bulk. But once the voltage becomes less 
than eb, then in the same manner as described in sec. 2.4.2 the p* region will 
become reverse-biased with respect to the surrounding n bulk. When the gate 
voltage at x becomes equal to ec (Vro) the p* region will be connected by an 
inversion layer of holes to the read-out region and be discharged. The amount 
of charge which flows will be proportional to the voltage swing bc. This de- 
pends on the threshold voltage Vro and on the value of the gate voltage at 
which the p* region begins to become reverse-biased. As we can see by referring 
to fig. 13 the voltage swing bc is now independent of x and alike for all the 
p* regions under the gate. This means a chain of pulses of equal magnitude 
will be obtained across the resistance R as the row of p* regions is discharged. 
Figure 16 shows the output pulse chain from a device of the foregoing type. 
As can be seen, the output pulses are of equal height and shape. The array used 
has the same dimensions as the one which yielded the results in fig. 15 
(Vis, p+ = 14 V, p* regions 100 100 ym’, separated by 30 ym from each 
other and by 20 um from the read-out region). 

Another method we used to obtain equal output pulses is by means of a 
double-gate structure. One, the charging gate, practically covers the p* regions. 
The resistive gate now only overlaps one edge of each of the p* regions of the 
array and one edge of the read-out region. The array of p* regions is simultane- 
ously charged by momentarily making the charging gate positive. The resistive 
gate is then used to sequentially discharge the p* regions. The foregoing struc- 
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Fig. 16. Copy of oscilloscope trace of the output obtained from a resistive-gated array similar 
in geometry to that shown in fig. 13, but with p* regions of lowered surface concentration. 
The array had 32 p* regions; relevant dimensions in um are given in fig. 14a. The vertical 
scale is | wA/div., horizontal | ms/div. 
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ture has the disadvantage of needing a separate signal to operate the charging 
gate. We later return to this double-gate structure in connection with other 
points of discussion. 


2.4.6. Reduction of dissipation in resistive gate 


A double-gate structure similar to the one just mentioned and illustrated in 
fig. 17 enables the dissipation in the resistive gate caused by the voltage gradient 


I Threshold 
voltage 


—— Distance 


Fig. 17. Double-gated array, the p* regions are insulated from the gates by an oxide layer. 


across it to be reduced. The gate G, which overlaps the edges of the read-out 
and the row of the p* regions is biased and operated as before. It is responsible 
for allowing the p* regions to be sequentially discharged. A voltage gradient 
is only applied across this gate. Because it is much narrower than the gate 
shown in fig. 13, the total dissipation across it, due to the current responsible 
for the voltage gradient, will of course be smaller than for the gate of fig. 13. 
The gate G; has novoltage gradient applied across it. One of its ends is, however, 
connected to the driving voltage as shown. The potential of this gate with re- 
spect to the underlying silicon for two values of the driving voltage is indicated 
in fig. 17 (see curves [1] and [2]). It can be reasoned as before that this gate G; 
will cause the p* regions in the row to become reverse-biased, so that an output 
pulse train will result when the row of p* regions is sequentially connected to 
the read-out region by the inversion layer under gate G,. The operation of the 
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array is similar to the ones already considered, except that the dissipation 
needed to establish a voltage gradient across the resistive gate is confined to 
a much smaller gate electrode. 


2.4.7. Voltage distribution along resistive gate 


In the structure of the type in fig. 13 the voltage gradient along the resistive 
gate is also dropped across the width of each pt region. This means that the 
inversion layer which connects the pt regions to the read-out region will not 
occur simultaneously across the width of a p* region. This results in a more 
gradual discharge of the regions. The situation can be improved as indicated 
in fig. 18a by more heavily doping those parts of the resistive gate over each 
p* region and the read-out region. This approach is not successful when high 
scan rates are desired because then it becomes necessary to dope the whole of 
the resistive gate to a relatively high level. The device geometry shown in fig. 185 
is preferable to the foregoing as it results in a better distribution of the voltage 
across the resistive gate between the p* regions of the row. We return to this 
structure later in connection with light-pattern scanners. 
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Fig. 18. (a) Resistive gate with heavily doped regions to improve the distribution of the 
voltage along the gate. 

(b) Resistive-gate array which offers a more desirable distribution of voltage along the resistive 
gate than the foregoing one. 


2.4.8. Driving-voltage waveform 


In the foregoing considerations we have used a triangular voltage wave for 
driving the gate (voltage between gate and substrate). If we refer to fig. 13 it 
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Fig. 19. Various types of driving voltages which can be applied between the resistive gate and 
the substrate, see e.g. fig. 13. 


can be seen that the scanning of the array (the time during which the p* regions 
are being sequentially connected to the read-out region) occurs in only a part 
of the period of the triangular wave, namely from [3] to [4]. During the part 
of the period from [1] to [3] the p* regions become reverse-biased. Since holes 
can be rapidly pumped out of the pt regions into the n bulk, a more efficient 
waveform for driving the array would be that shown in fig. 19b. For comparison 
the triangular wave is shown in fig. 19a. The same time scales and voltage scales 
are assumed. A still more efficient waveform is that shown in fig. 19c. 


2.4.9. Maximum scan rate and device parameters 


From a practical point of view it is interesting to consider the maximum scan 
rate of arrays of the foregoing type. A number of factors play a role. One of 
the most obvious limitations is the resistive gate and the underlying-oxide 
capacitance, which form a distributed RC network. We have derived else- 
where 7) an upper frequency limit, 

53 
L? 05 Cox 


where Z is the length of the resistive gate, o, the sheet resistivity of the poly- 
gate, and C,, the capacitance per unit area of the oxide under the gate. For a 
gate length of 5 mm an oxide thickness of 0-1 um and o, = 20 O/[], 


f — 28.106 Hz. (2.7) 


(2.6) 
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There is another scan-rate-limiting factor which in practice will dominate 
the foregoing one. This limit is imposed by the time needed to discharge the 
regions under the resistive gate. The discharge time can be reduced by using 
n-channel arrays, and improved geometries indicated in sec. 2.4.7. A further 
improvement can be obtained by reduction of the distance over which current 
must flow to discharge the charged regions. With standard technology a lower 
limit is 5 um. By use of say D-IGFET technology this can be reduced to 1 um 
or less. 

With the aid of the simple model presented in sec. 2.4.3 and relationships 
derived in appendix A, we have calculated the relationship between discharge 
time and voltage gradient along the gate electrode for different device geometries. 
The calculations are based on the criterion that a region must be fully discharged 
before the discharge of an adjacent one may begin. The results are plotted in 
fig. 20. The results correspond to a double-gated structure illustrated in fig. A.1. 
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Fig. 20. Calculations based on the theory of appendix A. The relevant device dimensions are 
shown in fig. A.1. Shown is the relationship between the voltage difference (;n t’) between 
adjacent p* regions along a resistive gate, and the time needed to discharge a p* region, 
before the discharge of an adjacent region begins. Unless otherwise stated the total voltage 
swing applied to the charging gate G2 was 10 V. The oxide thickness is 0:1 um. 


We assumed an 7 channel device (i.e. n* regions in a p substrate). One of the 
two gates is used to reverse-bias the m* regions, the other gate along which the 
voltage gradient is applied is used to discharge the regions. As explained in 
appendix A, although the double-gate system simplifies the calculations to some 
extent, the results can be adapted to a single-gate structure. The quantity mt' 
represents the voltage difference along the resistive gate between adjacent 
n* regions. 

It can be seen from fig. 20 that as the voltage gradient along the gate increases, 
the discharge time decreases for a given device geometry. Increase of the voltage 
gradient allows more current to flow to an n* region before the discharge of 
an adjacent region begins. The smaller a region (i.e. the smaller 7’) the less 
charge needed to discharge the region and hence the less time needed. The 
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smaller /, the larger the current for a given gate voltage and again the shorter 
the time for discharge. ; 

The curves of fig. 20 can be used as follows. Let us consider say 100 elements 
in an array, with regions of dimensions /'/ = 20 um?. We assume a voltage 
swing on the charging gate of 10 V, and 0-3 V between each element along the 
resistive gate; this corresponds to a total voltage across the resistive gate of 
30 V. We now estimate the time needed to scan the 100 elements, so that no 
overlap occurs between the discharge peaks of adjacent regions. Using curve 
I'l = 20 wm? and mt’ = 0:3 we obtain t’ = 1077 s as the time to discharge one 
n* region. Hence the time to scan 100 regions is 100x 10-7 = 10-5 s. Con- 
versely, if the total allowable scan time is known, we can estimate the voltage 
gradient needed along the resistive gate in order to avoid interference between 
the discharge currents for adjacent regions. 

The maximum voltage which can be applied along a resistive gate and hence 
the minimum scan rate for a given device geometry and number 7 of elements 
in an array will of course be limited by the breakdown voltage of the oxide 
under the resistive gate: we obtain 


nmt") < E, dox (2.8) 


where E, is the breakdown field for the oxide, and dax is the oxide thickness 
under the resistive gate. If we extract the limit value for mt' from (2.8), then 
curves similar to those of fig. 20 will yield the scan speed for a corresponding 
geometry and voltage swing on the charging gate. 


3. Resistive-insulated-gate light-pattern scanners 


3.1. Photo-current mode of operation 


A very simple type of resistive-insulated-gate light scanner is shown in fig. 21. 
This device has been reported on in detail elsewhere 7). We suffice here with a 
brief explanation of its principle of operation and indicate the more important 
salient features. For definiteness we assume a p*7 diode. Because of the voltage 
gradient due to V, along the transparent resistive electrode (polysilicon, thick- 
ness 0-2 um), an inversion layer will advance or retreat along the surface as V, 
goes respectively negative and positive. Assume a light pattern is incident on 
the gate electrode and that the rate of change of the light pattern is slow com- 
pared to the scanning rate of the inversion layer. The part of the pattern which 
falls on the inversion layer causes a photo-current to flow through the resistor R, 
which connects the p+ region to the n bulk. This photo-current is proportional 
to the total amount of light incident on the inversion layer. Hence if the inver- 
sion layer is made to vary in length with time, the differentiated output signal 
across R will be proportional to the light intensity at the extremity of the ad- 
vancing or retreating inversion layer. 
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Fig. 21. Plan view of a resistive-insulated-gate structure, and cross-section along aa’. Also 
shown is the voltage along the gate for different values of Vi. 


The simple device just described suffers from a serious drawback: a capacitive 
current also flows into the resistor R. This is due to charge flow to and from the 
growing or contracting inversion layer. The current increases with increasing 
scan frequency and even at relatively low frequencies is able to swamp the photo 
signal. As we explained before 7), this undesirable capacitive signal can be 
eliminated from the output by using the difference signal from two resistive 
gates. Light is projected only on one array. Such a technique extends the useful 
scan frequency up to hundreds of Hz. Some results for such a twin-gate arrange- 
ment are shown in fig. 22. 

Because the foregoing device operates in a photo-current mode it is inherently 
insensitive. In the following sections we describe devices which are more sen- 
sitive because they operate in an integration mode. 


3.2. Integration-mode arrays 


3.2.1. Single-gate structures 


The type of array considered in sec. 2.4, fig. 13, namely one with a row of 
p* regions under a resistive-gate electrode but insulated from it by an oxide 
layer, a common read-out region, can also be used to scan a light pattern and 
operate in an integration mode. Consider fig. 13; as explained earlier, as the 
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Fig. 22. Differentiated output of a twin-resistive-gate circuit for 9 light spots projected onto 
one of the gates. The voltage gradient along the gates of 42 and 90 V/cm corresponds to 
trace a) and 5) respectively. The growth and decay times for the driving voltage applied be- 
tween the gate and the substrate (see fig. 21) were 2-5 and 25 ms, respectively. 


voltage on the gate is increasing, holes are injected from the p* regions under 
the gate and are lost by recombination with electrons in the n bulk. When the 
voltage V, begins to decrease from its maximum value (condition [2] in fig. 13) 
the p* regions begin to demand holes, and become reverse-biased with respect 
to the surrounding n bulk. We ignore holes due to thermal generation. The 
demand for holes is not satisfied until the gate voltage equals the threshold 
value Vro (see fig. 13) or unless light is incident on the array. The quantity ac, 
(see fig. 13) is proportional to the time (integration period) during which the 
region F demands holes and is thus sensitive to light. Figure 23 shows the 
influence of an arbitrary light pattern on the output trace of a 32-element p” 
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Fig. 23. Output signal from a resistive-gated array of fig. 13 with light incident on parts of 
the gate. 
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array similar to the one considered in sec. 2.4 and which yielded fig. 15 in the 
absence of light. As can be seen, by comparing figs 15 and 23, the incident light 
causes a decrease of the magnitude of the output pulses. This contrasts with 
what is normally obtained from a light scanner for which the output pulses are 
proportional to the integrated incident-light intensity. 

The foregoing device suffers from two drawbacks. As explained earlier the 
output pulses for the p* regions are not alike, and neither for the same reasons 
are the integration periods. For instance ac in fig. 13 varies with x. One method 
we successfully used to overcome these drawbacks was already presented in 
sec. 2.4 in connection with the unequal output pulses. As we explained, the sur- 
face doping level of the p* regions under the resistive gate is lowered sufficiently 
to allow an n-type inversion layer to be induced there when the gate voltage 
exceeds a relatively low value V,, ,*. This value is indicated in fig. 13. When 
V, now decreases from its maximum value (curve [2]), electrons in the inversion 
layer at the surface of the p* regions will flow back to the n bulk. Only when 
the gate voltage over a p* region becomes lower than Viny ,* does that region 
demand holes and become sensitive (integration period begins) for light. Upon 
referring to fig. 13 it can be seen that the voltage difference bc which is also 
proportional to the integration period is independent of x. An output trace 
obtained from such an array is shown already in fig. 16. 

The various factors which govern the speed of operation of the foregoing 
arrays and the shape of the output pulses together with expressions relating 
the various quantities have already been considered in sec. 2.4. 


3.22. Uncoupling of integration time from scan time 


In both of the foregoing devices, the integration time (the period during which 
a p* region is sensitive to light) is coupled to the rate of change of the driving 
voltage V,. The total integration time for all the p* regions is related to the 
time it takes the voltage V, (see fig. 13) to change from value [2] to value [4]. 
In the remainder of this section we consider devices in which the integration 
time is uncoupled from the scanning time. 

Consider the device shown in fig. 24. The voltage gradient along the gate 
allows as before each p* region in the row to be sequentially connected to the 
read-out region when the driving voltage V, becomes sufficiently negative. As 
each p* region is connected to the read-out region it becomes charged to the 
potential V,. After scanning the row, the voltage V, decreases quickly to zero 
and the row of p* regions are left reverse-biased and isolated from the read- 
out region. If light is allowed to fall on the row (for a period assumed long 
compared with the time needed to scan the p* regions) the p* regions will be 
discharged somewhat. Now when the row is next scanned an amount of charge 
will flow to each p* region: this amount is proportional to the intensity of the 
incident light and the integration period. The output chain of pulses which 
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Fig. 24. Resistive-gated-array light scanner. The common p* region serves as charging and 
read-out region. 


appears across the resistance should thus be related to the variation of the 
intensity of the light along the row of p* regions. The foregoing mode of 
operation was proposed by Kabell !). Although very simple, in practice it suffers 
from a drawback. In the foregoing it was tacitly assumed that the voltage gra- 
dient along the gate is the sole determinant for the order in which the p* regions 
are discharged. It was assumed that once the gate voltage exceeded the thresh- 
old value near a p* region, this region would be connected to the output. 
The threshold was assumed alike for the whole resistive gate. Unfortunately 
this is only the case when the potential of all p* regions is alike: this cannot 
be the case once light of varying intensity is allowed to fall on the array. The 
variation of potential of the p* regions gives a corresponding variation in thresh- 
old voltage with a possibility of non-sequential scanning and in any case a 
time separation between the output pulses which is no longer proportional to 
the spatial variation of the light intensity along the array. 

We will now present results for an array which is operated in ways which 
avoid the drawbacks of the preceding devices, namely integration time coupled 
to scan time, non-sequential scanning. The array we consider has been designed 
to allow a better-defined current-flow path to the p* regions from the read-out 
region such that a more rapid discharge occurs. This array also represents a 
feasibility study of the resistive-gate scanning technique for relatively long 
arrays (100 or more elements in an array). 

The array we consider is n-channel and is shown in fig. 25. We operate this 
array in two ways, which we consider in turn. With switch S in position 1, the 
n* regions are reverse-biased when connected to the read-out region by an 
inversion layer, which is forced to grow under the gate by a signal (increasing 
linearly with time) between the gate and the substrate. The inversion layer is 
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Fig. 25. Plan view and cross-section of part of a 100-element resistive-gate line scanner. The 
dimensions are in pm. 


next allowed to decay; switch S is set to position 2 and integration begins; 
next, with S still in position 2, the n* regions are scanned by applying the same 
signal as before to the gate. The n* regions are now sequentially discharged. 
The greater the light intensity the smaller the output pulse for an n* region. 
The influence of an arbitrary light pattern on the output is shown in fig. 26b 
(compare with 26a: no light). The results were obtained for values of R = 10* 9, 
V, —13 V, V, —3 V. This latter voltage was necessary because the thresh- 
old voltage for these particular samples was slightly negative (~ 200 mV). 
This resulted in a weak inversion layer at the SiO;-Si interface which pre- 
vented the n+ regions in the row being isolated from the read-out regions. 
Use of V, cut off this conducting path and ensured that the light-sensitive row 
of n* regions could only be connected to the output only via inversion layers 
induced under the resistive-gate fingers. 

Figure 27 shows a complete output trace of which only a portion is shown 
in fig. 26. As can be seen, the output pulses are superimposed on a triangular 
signal. This is due to the growth of the inversion layer under the gate and its 
elimination has been mentioned already. 

Figure 28 shows the end of the trace of which the beginning is shown in 
fig. 26a and b, and fig. 29 shows, on a more expanded scale, a portion of the 
beginning of the same trace. The foregoing results were all obtained with 
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Fig. 26. (a) Part of an output signal which appears across R in the absence of light for the 
arrangement in fig. 25. 
(5) Influence of an arbitrary light pattern on the output trace. 
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Fig. 27. Complete output trace for a 100-element resistive-gated array, of the type shown in 
fig. 25. The voltage along the gate was 24 V. 
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Fig. 28. Part of the output signal which appears across R for the device of fig. 25. This is 
for the last n* regions to be discharged while fig. 26a shows the trace on the same scale for 
the first n* regions of the array which are addressed. 


472 M. V. WHELAN, L. A. DAVERVELD and J. G. de GROOT 


Current(A] 


| 


e] — Time 
3105s 


Fig. 29. Part of the output trace shown in fig. 26a but with an expanded time scale. 
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Fig. 30. Part of the output trace for the resistive array shown in fig. 25; the voltage along 
the gate has been lowered to 12 V compared to 24 V for figs 26 to 29. 


24 volts applied across the resistive gate. Figure 30 shows the effect of lowering 
the voltage to only 120 mV between each n* region. Even for this relatively low 
voltage gradient, distinct peaks can still be obtained. 

The danger of non-sequential scanning has been avoided in the foregoing 
array by ensuring that during scanning the n* read-out regions are at the same 
potential and act effectively as sources by being at a lower potential than the 
light-detecting n* regions. These latter now correspond to floating drains of 
IGFET transistors. 

The second mode *) of operation we successfully used was as follows. An 
auxiliary gate electrode is allowed to cover (but is insulated by an oxide from 
them) the light-detecting n* regions. This gate is momentarily made negative. 
The integration period then begins and subsequently the n* regions are sequen- 
tially scanned as before. The read-out region is now continually connected 
to earth via R (or to some potential V, for the reason mentioned earlier). 

For a discussion of factors such as maximum scan speed and its dependence 
on geometry and voltage gradient along the resistive gate one is referred to 
sec. 2.4 and its associated appendix A. A more detailed analysis of the fore- 
going device will be published later. 


*) This method yields a device with a larger dynamic range than the foregoing one, because 
charge can now also be stored in the relatively large oxide capacitance between the gate 
and the p* regions. 
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3.3. Conclusions 


A number of light-pattern scanners have been considered in this section, 
starting with a photo-current mode device. Although simple to make and 
operate it lacked sensitivity. We then considered a device capable of integration- 
mode operation (single resistive insulated gate with p*(n*) regions under the 
gate). This had the drawbacks of unequal integration times for the elements of 
the array, also unequal output signals for the same elements and the integration 
time was coupled to the scan time. An array made at the expense of an extra 
technological step was considered next. It had p* regions with lowered surface 
doping. This device still retained, however, the last of the above drawbacks. 
Arrays were considered next in which the resistive gate no longer had the dual 
function of charging the p* regions and subsequently allowing them to be 
scanned. One type had only a resistive gate which allowed the p* regions to be 
charged by being connected to a read-out region and then allowing them to be 
subsequently scanned by being connected to the same read-out region but which 
was now at a lower potential than when it was used to charge the p* regions. 
The other type of array was a twin-gate set-up. In this arrangement the read-out 
region was held continually at a fixed potential (preferentially zero). The second 
gate which overlapped the row of p* regions was used to charge them. The 
resistive gate enabled the row to be scanned by connecting the p* regions 
sequentially to the read-out region. Of the arrays considered, the latter two 
types offer the most promise from the point of view of ease of manufacture and 
operation. 

The arrays considered were all line-type scanners, and we believe that resistive- 
gate arrays are quite attractive as line scanners, they are relatively simple to 
make using standard silicon-gate technology and normal tolerances; they offer 
quite high packing densities, since each element (p* or n*) in a row is active 
in detecting incident light. We have demonstrated the feasibility of relatively 
long arrays up to 100 elements (fig. 31). Finally the arrays are simple to operate 
requiring no complicated clocking system. One practical objection to the 
resistive-gate scanners could be the voltage needed across the gate. This could 
become objectionably large if resolution is to be retained at relatively high 
scanning speeds. This problem and solutions to it will be considered in a later 
publication. 

From the point of view of two-dimensional scanners, the resistive-gate tech- 
nique is not so attractive especially when compared with other techniques 8). 
This conclusion is based on some preliminary experiments with a two-dimen- 
sional array of 20 resistive gates with 32 p* elements per gate. The output of 
each line was selected by means of a double-resistive-gate arrangement (over- 
lapping inversion layers) similar to that considered in sec. 2.2. Results on this 
arrangement will be published at a later date. 
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Fig. 31. Photograph of one end of the 100-element resistive-gate array. 


4. General conclusions 


Although more or less exploratory in nature, this paper — we hope — has 
illustrated some of the possibilities opened up by resistive-insulated-gate tech- 
nique, e.g. steering circuits for LED indicators, analog-digital conversion and 
light-pattern scanning. Despite the fact that the designs presented were not op- 
timised, the results obtained together with the aid of expressions derived, indi- 
cate exciting new possibilities for resistive-insulated-gate arrays. This applies 
particularly in the area of analog switches and A/D conversion, especially when 
simplicity of operation and design, and conversion speed are important. With 
present techniques 6 to 7 bits and conversion speeds of less than | ps are 
feasible, although in the devices presented the lack of sensitivity (relatively large 
voltages needed across the resistive gates) and the use of floating supplies are 
drawbacks. In later work we will dwell more deeply on these problems and 
show how they can be alleviated. 
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Appendix A. Relationship between scan speed, voltage gradient, and geometry 
for a resistive-insulated-gate array 


In this appendix expressions are derived, which relate device geometry, scan 
speed, and voltage gradient along the resistive insulated gate for devices similar 
to those considered in secs 2.4 and 3.2. The device geometry considered is 
shown in fig. A.1. This is a double-gated device. Gate G, is used simply for 
scanning, i.e. for sequentially connecting the p* regions to the read-out region. 
Gate G, is used to charge the pt regions. It overlaps these regions but is 
insulated from them. By momentarily making this gate positive holes are ejected 
from the p* regions into the surrounding z bulk where they are lost by recom- 
bination. When the voltage is reduced again to zero or even a negative value, 
holes will be demanded by the p* regions. If thermal generation of p-n pairs 
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Fig. A.1. Plan view and cross-section of part of a resistive-gated array. The resistive gate Gy 
is used only for read-out purpose, the other gate G3 is used for charging the row of p* regions. 


can be neglected then the p* regions become negatively charged with respect 
to the surrounding n material. The p* regions can become discharged by being 
connected to the read-out region by an inversion layer which can be made to 
grow under gate G,. 

The device geometry shown in fig. A.2 consists of only one gate and is a 
similar type of array as considered in sec. 2.4. The gate fulfils the dual function 
of charging the p* regions and subsequently allowing them to be sequentially 
discharged. An analysis of this array would appear to be more complicated 
than that of fig. A.1. Because of the voltage gradient along the gate, each p* 
region of the row “sees” a different maximum positive voltage before being 
discharged. This, as explained in sec. 2.4.2, means a different amount of charge 
has to flow to each p* region in order to discharge it. A further complication 
would appear to be caused by the fact that the gate voltage is changing during 
the discharge of a p* region, which means that the amount of holes which must 


n-Si 


Read-out contact Resistive gate 
Si 05 
p* 
a Li 


Fig. A.2. Plan view and cross-section of part of a resistive-gated array. The gate fulfils the 
dual function of charging the row of p* regions and subsequently allowing them to be scanned. 
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flow to a p* region also changes during discharge. Despite the foregoing com- 
plications we can apply the expressions for the device of fig. A.1 to the device 
of fig. A.2. We can assume (supported by experiment) that once the discharge 
of a p* region begins, the time of discharge is so short that we can assume that 
the voltage on the gate over a p* region changes by an amount which is negligible 
compared to the preceding voltage changes on the gate. We thus assume that 
the total voltage excursion which is responsible for charge flow to a p* region 
once it is connected to the read-out region is V7 + Vro, where V i is the 
largest positive voltage which appeared over the particular p* region being 
considered, since it was last discharged. The threshold voltage is denoted by 
Vro. For p-channel arrays if Vro < 0 then the two voltages add. The voltage 
Vt + Vro is substituted into the expressions for the double-gate system as 
being the total voltage excursion responsible for charge flow to the p* region. 
For the device of fig. A.2, the voltage excursion is different for each p* region 
because of the voltage gradient but it can easily be estimated. The discharge for 
the whole array will be the sum of the times for each p* region. 

Expressions will now be derived for the structure of fig. A.1. The gate G, 
which charges the underlying p* regions, does so by first being raised to a 
potential Vp max and then being held either at zero or a negative potential V. 


When the potential of G, is raised to Vizma then 
Osi m Q 
mar 2 (A.1) 
Cox Cox 


The charge at the silicon surface is Qs; m. We assume that because the p* region 
is rather heavily doped that the voltage drop across the silicon-surface space- 
charge region under G; can be ignored compared to the voltage drop across 
the oxide. For simplicity we also assume that G, completely covers each 
p* region. Qox is the effective charge located at the SiO;-Si interface and it 
represents oxide charge and that in surface states. 

When F,, is reduced to some value V,, .., and held constant, the p* regions 
under G; become reverse-biased. We ignore the influence of leakage currents. 
Before discharging of the p* regions occurs we can write 


Osi m^ Q Qo 
LR spur m + V, va C , (A.2) 


where Q, represents the charge of exposed acceptors in the p* region at the 
pn junction. This is due to holes which have moved up to the SiO;-Si inter- 
face when V,, was reduced. This redistribution of charge in the pt region 
causes a reduction of the silicon-surface space charge and at the same time a 
reverse-biasing of the p* region with respect to the n substrate. 
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After V,, has been reduced to V, and inversion has been created under 


2min 


gate G, so that a certain amount of charge has flown to a p* region then 


t 
Osim + S H(t')/ WI] dt'— Q, 
issu a a ++ V, ame Qox Cox; (A.3) 
Cox 
I(t) is the current flow of holes to a p* region via the inversion layer. The area 
of the region is WI’. We define t = 0 as the moment at which current flow 
to a p* region begins, namely when the gate voltage of G, becomes Vro at 
the p* region in question. 
For the voltage V, between the p* region and the n substrate we write 


q Na 


E [y?(t) — 907], (A.4) 


Esi 

where y(t) is the width of the space-charge region between the p* region and 
the n bulk. We assume for simplicity an abrupt junction; yo is the width of the 
space-charge region corresponding to the diffusion potential for the junction. 
The doping level of the n substrate is denoted by N4. The dielectric constant 
of the silicon is denoted by eg. We can write 


Q, — —4 Na D(t) — yo]. (A.5) 
Using eqs (A.5) and (A.4) we obtain 
qN Q Q 
eel (- 2 ) (- E 2o) (A6) 
2 Esi q Na q Na 
Equation (A.1) gives 
Osi m — —Cx Vomax T Qox: (A.7) 


Substituting eqs (A.6) and (A.7) into (A.3) gives 


-y Qox 1 dt’ Q, 4 Q, (- Q, 
£2min 82max Cox Uw Cox Cox 2és; 


0 


Upon simplifying we get 


1 
0,8 204 q Na 


S2max 


+ Vermin + | ne )- 0 (A9 


+26 v (7r 
sid iva JE WO 
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Solving this for Q, and using the knowledge that Q, must be zero if 
V, —, = 0 and I(t) = 0, we obtain 


82max 82min 


1 y 
2, = 29 4Na(z £M. 


ox esi 


l yy * I(t) dt 
— [54 Na e ee —2 es1 g Naf —Vaz max F Vermin F | wc) 


(A.10) 


If we know how the current J(7) varies with time, Q, can be calculated and in 
turn with the aid of eq. (A.6) so also can the junction voltage. 

As explained in sec. 2.4.3 the discharge current to a p* region increases to 
a maximum value and then decreases rapidly to zero. The time to reach the 
peak value represents the major portion of the discharge period so that an 
estimate of this value gives a good idea of the discharge time. We also pointed 
out in sec. 2.4.3 that, up to the peak of the discharge, the current flow to a 
p* region is that of an IGFET in saturation. The channel length is /, and the 
width W. So the current up to the peak value is 


Kt) — £ [V,,(t) — Vro]? (A.11) 


where f = u Cox WII. 

The end of the saturation mode (i.e. at time ¢ = t’) is reached when the drain 
potential, in this case the junction voltage V, between the p* region and the 
n bulk, is such that 


V(t’) = V,,(t') — Fro. (A.12) 
If we assume that the potential of the resistive gate G, changes linearly with 
time and at a rate of m volts/unit time, and if t = 0 corresponds to the be- 


ginning of the discharge of a pt region, i.e. the moment at which the voltage 
on G; is Vro near the pt region in question, then we can write 


I(t) = ^ (m t). (A.13) 


Substitution of this into eq. (A.10) and simplifying gives 
Esi q Na y m? t? 


+ 
» (A14 
311’ ) : ) 


Q, —0— G $ 2 Esi 7 Na (Vi, ax A Veomin) m 


1 Jo 
X =E N| — + — |}. 
sid AC: ) 


ox Esi 
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Also we can write eq. (A.12) as 
Vt) mt. . (A.15) 


The discharge time for a single p+ region is roughly the time needed to reach 
the peak in the discharge current. Hence for a double-gate system the total 
scan or discharge time for an array of 7 p* regions is y t’. 

Equation (A.14) allows Q, to be calculated versus time. Then with Q, and 
eq. (A.6), the voltage V,(t) can be estimated. The time ¢’ is obtained when 
eq. (A.15) is satisfied. 

The voltage mt’ is the amount that the gate voltage on gate G,, at the 
pt region in question, is in excess of its threshold value Vro, when that pt 
region is practically discharged. Now if we are to avoid interference between 
the discharge currents for adjacent p* regions, then the gate voltage on G, 
opposite the adjacent but yet undischarged pt region may not be larger than 
the threshold value Vro. Hence the voltage drop along the resistive gate be- 
tween adjacent pt regions of the row must be at least equal to mt’. Hence 
for an array of 7 elements the total voltage along the resistive gate must be 
nmt. 

With the aid of the foregoing expressions some curves shown in fig. 20 were 
calculated which give some idea of the relationship between scan time, voltage 
gradient and device dimensions. The criterion of non-interference between dis- 
charge currents for adjacent elements was used, i.e. minimum voltage drop 
along the resistive gate between adjacent elements of m t’. For simplicity the 
calculations were done for a double-gate array. The total voltage swing 
Veomax — Vezmin ON gate Gz is indicated. (Earlier we explained how to tackle 
a single-gate device.) The other quantities used in the calculations are indi- 
cated. In order to obtain some idea of upper limits the calculations were per- 
formed for a mobility of 800 cm?/V s, i.e. n-channel arrays were assumed. 


Appendix B. Charging time of a ROM line capacitance by a double-resistive- 
insulated-gate arrangement 


We derive here the time £, needed to charge a ROM line of the arrangement 
of fig. 11 to a certain fraction (1 > « > 0) of its final voltage V,m. The situa- 
tion we consider is illustrated in fig. 11, which shows the sampling gate turned 
off, i.e. it is grounded. The problem is to estimate how quickly the line Grom 
can be charged via the overlapping inversion layers to some limiting potential. 
The problem can be reduced to that represented by the simple équivalent circuit 
in fig.11. The IGFET M,, of effective channel length 2Z, represents the over- 
lap of the inversion layers under the resistive gates at that point; M, represents 
an IGFET from the sampling array, and is not conducting. The capacitance 
of the ROM address line Grom with respect to earth is represented by Crom. 
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For simplicity we assume this to be independent of vollage and to be deter- 
mined by the oxide thickness under Grom- 

With the aid of standard IGFET theory 5) it can be shown that for the 

IGFET M, 
N42 eg M* dv 
V = V; + (==) KV, + Vo)* — Vot] + : 


ox 


+ Vro; (B.1) 


V, is the gate voltage of M, (it is the voltage on the resistive gates at the 
point of overlap of the inversion layers); 

V, is the potential of the source of M, and thus the potential of the address 
line Grom of the ROM with respect to the substrate; 

Vro is the value of V, needed to turn on IGFET M, when J, is zero; 

N, is the doping level of the substrate of M,, which is assumed p-type; 

C, is the capacitance of the oxide per unit area under the gate of M, ; 

Vo = q N4 yo?|2es is the diffusion potential for the source and drain diodes 
of M, (abrupt junctions are assumed for simplicity); 

yo is the width of the foregoing space-charge regions for no voltage applied 
to the junctions; 

£g; is the dielectric constant of the silicon; 

Qiny is the mobile inversion charge per unit area (assumed homogeneous over 
the greater part of the channel) under the gate of M,. 

We assume for M, that +V > V,, so that the maximum value V,m which V, 

can attain for a fixed V, will be that corresponding to Q,,, being zero, so that 


V, = Vem y (Vem + Vo)? — Vot] + Fro (B.2) 
where 
m (q N42 es;)* 
EF wn 
The problem is to calculate the time t, needed to charge the capacitance Crom 
from zero volts to some voltage « V,m (x lies between 0 and 1). For the charg- 


ing of Crom we can write 


aon 


fat = J own dV,/T. (B.3) 


The IGFET M, operates in its saturation region so that we can write for the 
current J in it: 


T= - (V,— Vx)? (B4) 


where B = u Cox WJI; u is the mobility of the carriers responsible for the cur- 
rent Jin IGFET M,; W is the width of the conducting path of J; / is the 
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effective source-to-drain distance in M, and is thus 2Z. Vy is the value of 
V, needed to turn on IGFET M;: it depends on V, and eq. (B.2) yields 


Vr = V, + y KV, + Vo)* —Votl + Vro: 


(B.5) 


We substitute V4 from eq. (B.5) into eq. (B.4), substitute the resulting expres- 
sion for J into eq. (B.3) and after a certain amount of manipulation we obtain 


l 
4 Crom (y? = 4A)* 


——_—_— ——— 


B t. —y b+ (a Vem + Vo)* a + Vo? 
4Crom ie Vom + Vi b + 73) 
a (& Vem + Vo)? — Vot 
~ (9? —43) (a + Vo?) [a + (& Vom + Vo)? 
mm. (a Vom + Vo) — Vo 
— (y — 41) (b + Vo) [b + (a Vom + V9)*] 
where 
À =—V, + Vro — y Vo* — Vo, 
a=4[y—(y? —421], 
b =+ [y + (y? —42. 
In practice it turns out to be possible to write: 


B ta a (a Vm + Vo)? imm ^d 


4Cuow —— (9 — 44) (a+ Vot) [a + (x Vem + V3]. 


(B.6) 


(B.7) 
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